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Abstract 


"“The  literature  on  practice  effects  and  transfer  from  single-  to  dual-task 
performance  and  part-whole  task  learning  are  briefly  reviewed.  The  results  suggest  that 
single-task  training  produces  limited  transfer  to  dual-task  performance.  Past  theoretical 
frameworks  for  multi-task  performance  are  reviewed.  A  connectlonist/control 
architecture  for  skill  acquisition  Is  presented.  The  architecture  involves  neural-like  units 
at  the  microlevel,  with  Information  transmitted  on  vectors  between  modules  at  the 
macrolevel.  The  simulation  of  the  model  exhibits  five  phases  of  skill  acquisition.  Dual¬ 
task  interference  and  performance  are  predicted  as  a  function  of  the  phase  of  practice 
the  skill  has  reached.  Seven  compensatory  activities  occur  In  the  model  during  dual-task 
training  that  do  not  appear  In  single-task  training:  l)  task  shedding,  delay  and  buffer 
pre-loading;  2)  letting  go  of  high-workload  strategies;  3)  utilizing  noncompeting 
resources;  4)  time  multiplexing;  5)  shortening  transmissions;  6)  converting  Interference 
from  concurrent  transmissions;  and  7)  chunking  transmissions.  Future  research  Issues 
suggested  by  the  architecture  Include:  Mapping  out  the  marginal  utility  of  single-  to 
multi-task  transfer:  Investigating  the  classification  of  multi-task  compensatory  activities; 
evaluating  the  role  of  part-task  trainers  for  multi-task  skills;  and  developing  and  testing 
quantitative  models  of  skill  acquisition.  (  h  '/  'I 
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Introduction 

Despite  the  fact  that  a  vast  number  of  studies  have  attempted  to  chart  the  kinds 
and  nature  of  interference  that  occurs  when  two  or  more  tasks  are  performed 
simultaneously,  little  is  known  about  the  role  of  practice  In  reducing  Interference.  The 
few  studies  which  have  investigated  practice  have  demonstrated  that  human 
performance  can  change  radically  over  time.  Moreover,  given  its  primary  Importance  In 
achieving  high  levels  of  skill,  there  have  been  few  attempts  to  provide  theoretical 
frameworks  sufficiently  detailed  to  address  Issues  such  as:  l)  why  human  performance 
becomes  more  accurate.  2)  why  It  becomes  faster,  3)  why  Interference  between  tasks 
decreases  with  practice,  and  4)  why  single-task  training  does  not  always  transfer  to  dual¬ 
task  performance.  This  paper  examines  issues  of  practice  and  workload.  First,  we 
summarize  a  number  of  issues  concerning  single-  and  dual-task  performance  and  training. 
Next  we  describe  a  runnable  simulation  architecture  used  to  investigate  a  variety  or 
single-  and  dual-task  phenomena.  Then,  we  use  this  architecture  as  a  framework  to 
clarify  a  few  central  Issues  and  problems  surrounding  the  kinds  and  nature  of  dual-task 
interference  that  occur  and  illustrate  how  it  many  change  as  a  function  of  practice.  And 
finally,  we  speculate  on  the  kind  of  research  and  modelling  needed  to  develop  more 
predictive  and  theoretically-based  approaches  to  understanding  dual-task  training  and 
performance. 


Interference  Between  Highly  Practiced  Tasks 

Bahrlck  and  Shelly  (1958)  provide  strong  support  for  the  Idea  that  even  after 
prolonged  single-task  training  the  addition  of  a  secondary  task  can  have  a  pronounced 
Impact  on  the  practiced  task.  Bahrlck  and  Shelly's  subjects  all  received  25  training 
sessions;  a  visual  serial  reaction  time  task  was  practiced  alone  on  sessions  1-2,  4-13,  15-24 
and  combined  with  a  secondary  auditory  task  on  sessions  3.  14,  and  25.  Single-task 
sessions  consisted  of  10  trials  of  100  stimuli  each,  and  dual-task  sessions  were  4  trials. 
AJthough  performance  on  the  visual  task  reached  a  relatively  high  level  of  proficiency 
within  the  first  20  trials,  when  paired  with  the  auditory  task,  performance  on  session  3 
dropped  as  little  as  15%  for  a  group  trained  on  consistent  stimuli  to  as  much  as  40%  for 
a  group  trained  on  inconsistent  stimuli.  On  sessions  14  and  25  performance  dropped 
from  about  8%  for  the  gTOup  with  most  consistent  stimuli  and  about  40%  for  the  group 
with  the  least  consistent  stimuli.  The  last  session  showed  no  additional  benefits  in  the 
dual  task  of  the  extra  10  single-task  practice  sessions.  In  other  words,  despite  substantial 
single-task  practice,  subjects  were  unable  to  Jointly  perform  the  auditory  task  without 
incurring  a  cost  on  the  practiced  visual  task-wlth  the  most  Inconsistent  visual  tasks 
suffering  the  largest  costs. 

Damos,  Bittner.  Kennedy,  and  Harbeson  (1981)  offer  further  evidence  of  the  need 
for  dual-task  practice  and  illustrate  the  time  needed  to  regain  single-task  performance 
levels  after  the  two  tasks  have  been  combined.  In  this  experiment  subjects  performed  15 
trials  (about  15  minutes)  per  day  of  two  concurrent  tracking  tasks  for  15  successive 
working  days.  One  task  required  use  of  the  left  band  and  the  other  the  right.  Prior  to 
this  study  subjects  had  received  15  sessions  of  single-task  critical  tracking,  15  sessions  of 
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compensatory  tracking,  and  15  additional  sessions  of  a  critical-compensatory  tracking 
dual  task.  In  other  words,  subjects  had  received  a  total  of  45  prior  trials  over  a  15  week 
period.  Damos  et  al.  found  that  performance  on  the  dual  task  improved  sharply  over  the 
first  four  days  of  practice  and  continued  to  Improve  throughout  the  testing  period. 
Moreover,  after  15  sessions  of  dual-task  practice,  subjects'  performance  approximated  the 
levels  obtained  after  three  single-task  practice  sessions.  This  study  provides  evidence 
suggesting  that  even  after  substantial  single-task  practice,  additional  practice  was  needed 
to  stablize  performance  when  two  tracking  tasks  had  to  be  performed  concurrently. 

Long  ago  It  was  noted  that  some  people  can  develop  the  ability  to  read  and  dictate 
simultaneously  (see  Solomons  and  Stein,  1806;  Downey  and  Anderson.  1915).  Spelke, 
Hirst,  and  Nelsser  (1976)  were  the  first  in  recent  times  however  to  document  the  effects 
of  practice  on  two  subjects’  abilities  to  read  short  stores  while  copying  auditorily 
presented  words.  Initially  the  subjects’  reading  speed  and  comprehension  scores  were 
seriously  Impaired,  yet  after  six  weeks  of  practice  their  dual-task  performance  measures 
matched  their  single-task  baseline  scores.  Near  the  end  of  the  experiment  Spelke  et  al. 
introduced  two  transfer  tasks  in  which  the  two  subjects  either  read  aloud  or  shadowed 
prose  while  copying  words  as  before.  Copying  produced  decrements  In  both  reading 
aloud  and  shadowing.  Reductions  In  Interference  were  evident  as  the  the  tasks  were 
practiced  together.  In  a  rollow-on  study.  Hirst,  Spelke,  Reaves,  Caharack.  and  Nelsser 
(1980)  replicated  their  earlier  study  and  demonstrated  further  that  with  substantial 
practice-over  50  hours-subjects  were  able  to  read  both  highly  redundant  and  less 
redundant  materials  nearly  as  quickly  and  with  comparable  comprehension  as  their 
single-task  levels.  In  a  second  experiment  subjects  learned  to  read  short  stores  and  at 
the  same  time  copy  sentences  varying  In  length  from  3  to  7  words  read  at  rates  between 
20  to  31  words  per  minute.  Following  substantial  dual-task  training,  subjects  achieved 
reading  and  comprehension  rates  similar  to  single-task  control  rates. 

Shaffer  (1975)  has  documented  one  highly  practiced  typists  (SW)  abilities  to 
perform  dual  tasks.  First,  SW  was  able  to  type  visually  presented  material  and 
concurrently  recite  nursery  rhymes  at  a  cost  of  about  io%  In  typing  speed  and  accuracy 
over  single-task  baselines.  Second,  SW  was  able  to  concurrently  type  and  shadow  prose 
presented  audltorlally  at  a  rate  of  140  words  per  minute,  again  with  a  comparable  10% 
cost  In  speed  and  accuracy.  Third,  SW  was  able  to  concurrently  type  and  shadow 
random  letters  presented  auditorily  at  a  rate  of  one  per  second  with  similar  cost. 
However,  when  SW  was  required  to  type  auditorily  presented  material  and  to  shadow 
visual  input,  her  speed  and  accuracy  suffered  greatly.  Similarly,  when  SW  was  required 
to  type  auditorily  presented  material  and  shadow  auditory  Input— a  male  voice  In  one  ear 
and  a  female  voice  In  the  other  ear-her  performance  was  markedly  Impaired. 

Ailport,  Antonis,  and  Reynolds  (1972)  recruited  piano  players  who  were  able  to 
play  by  sight  reading.  Subjects  participated  In  two  training  sessions  In  which  they 
attempted  to  shadow  auditorily  presented  prose  while  sight  reading  material  of  varying 
difficulty.  By  the  second  session  subjects'  performance  In  the  dual  task  was  comparable 
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to  single-task  baselines.  In  addition,  when  tested  for  recall  of  the  prose  materials, 
subjects'  memories  were  comparable  In  single-  and  dual-task  conditions. 

Schneider  and  Fisk  (1982)  raised  the  question  of  whether  two  visual  search  tasks 
could  be  performed  concurrently  without  a  drop  In  accuracy.  They  showed  first  that 
practice  under  single-task  consistent  mapping  (CM),  l.e.,  where  the  same  response  Is 
made  to  stimuli  across  trials,  conditions  improved  search  performance,  whereas  it  did  not 
In  Inconsistent  or  varied  mapping  (VM),  l.e.,  where  the  response  made  to  a  specific 
stimulus  varies  across  trials.  Performance  under  the  dual-task  CM  conditions  showed  a 
dramatic  Improvement  with  practice,  benefiting  the  shortest  display  durations  most. 
Performance  under  the  dual-task  VM  condition  also  Improved  somewhat,  but  did  not 
reach  single-task  \*M  performance  at  the  end  of  the  first  experiment.  A  second 
experiment  further  showed  that  subjects  were  able  to  perform  concurrent  VM  and  CM 
searchs  when  the  VM  task  was  emphasized.  While  VM  performance  did  not  result  In  a 
deficit,  CM  performance  did  drop  by  17%.  However,  when  the  CM  task  was 
emphasized.  VM  performance  dropped  precipitously  to  near  chance  levels.  Finally,  they 
also  demonstrated  that  concurrent  VM  searches  could  not  be  performed  without 
substantial  deficit  which  did  not  diminish  with  additional  practice. 

In  another  set  of  experiments  Fisk  and  Schneider  (1983)  had  subjects  perform  three 
single  tasks,  l.e.,  a  digit-span  task,  a  consistent  visual  category  search  task,  and  an 
Inconsistent  visual  category  search  task  and  two  dual-tasks,  l.e.,  the  digit  task  Joined 
with  each  of  the  two  search  tasks.  When  the  digit  and  consistent  search  tasks  were  rirst 
performed  together,  detection  accuracy  dropped  by  about  io%  from  the  single-task 
detection  baseline.  After  90  additional  trials  of  dual-task  practice,  however,  detection 
accuracy  reached  a  level  comparable  to  single-task  accuracy.  In  contrast,  when  the  digit 
and  Inconsistent  search  tasks  were  first  performed  together,  detection  accuracy  dropped 
by  about  25%  from  single-task  levels.  Moreover,  additional  dual-task  practice  did  not 
Improve  detection  accuracy.  In  contrast.  Inconsistent  tasks  show  marked  deficits  In  dual¬ 
task  conditions  and  these  deficits  decline  little  with  extended  practice.  When  single  tasks 
are  combined  for  the  first  time,  there  Is  a  substantial  decrement  In  performance  even  If 
the  tasks  are  consistent  and  extensively  trained.  Extended  dual-task  training 
substantially  Improves  consistent-task  performance  but  not  Inconsistent-task 
performance. 

The  first  session  of  dual-task  training  typically  produces  dramatic  drops  in 
performance  of  either  CM  or  VM  performance.  Schneider  and  Fisk  (1984)  examined  the 
effects  of  practice  and  transfer  In  consistent  and  Inconsistent  category  visual  search 
tasks.  Subjects  first  performed  three  single-task  searches,  l.e.,  digit  search,  consistent 
category  search,  and  Inconsistent  category  search.  Subjects  completed  1.755  single-task 
trials  (7-8  45-mlnute  sessions)  for  both  the  consistent  and  Inconsistent  tasks.  This  was 
followed  by  two  dual-task  conditions  in  which  the  digit  search  primary  task  was  paired 
with  each  of  the  semantic  search  secondary  tasks.  Hence  subjects  were  required  to 
detect  both  digits  and  category  exemplars  concurrently.  The  first  time  each  of  the  two 
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semantic  search  tasks  was  combined  with  the  digit-search  task,  the  semantic  search 
performance  dropped  substantlally~45%  for  the  consistent  task  and  49%  for  the 
Inconsistent  task-relative  to  single-task  performance  levels.  Subjects’  performance  at 
detecting  digits  also  dropped  substantlally-26%  for  the  consistent  task  and  29%  for  the 
Inconsistent  task.  Subjects'  performance  on  the  consistent  category  search  task 
Improved  with  additional  dual-task  practice,  matching  single-task  levels  after  four 
additional  sessions  and  celling  after  eight.  Subjects’  performance  on  the  Inconsistent 
category  search  task  failed  to  improve  with  additional  dual-task  practice;  the  smallest 
decrement  was  49%  and  the  largest  was  61%,  relative  to  single-task  performance. 
Finally,  It  should  be  noted  that  even  after  eight  sessions  of  dual-task  practice, 
performance  on  the  digit  task  still  incurred  an  11%  decrement  from  Its  comparable 
single-task  level  for  the  consistent  search  task  and  a  i?%  decrement  for  the  inconsistent 
search  task. 

The  results  on  CM  or  VM  practice  show  that  given  a  high  degree  of  consistent 
practice  on  a  search  task,  subjects  could  simultaneously  perform  dual  tasks  with  little 
deficit,  yet  they  needed  practice  under  dual-task  conditions  to  reach  their  single-task 
detection  levels  when  the  two  tasks  were  combined.  Further,  despite  high  degrees  of 
single-task  practice,  subjects  were  unable  to  perform  the  inconslstent/conslstent-search 
and  digit-span  tasks  without  substantial  deficit. 

Automatic  controlled  processing  theory  suggests  that  two  qualitatively  different 
forms  of  processing  can  account  for  the  marked  changes  that  can  occur  In  performance 
with  consistent  practice  (see  Schneider,  Dumals,  &  Shlffrln,  1984;  Schneider  &  Shlffrln, 
1977;  Shlffrln  &  Schneider,  1977).  Controlled  processing  Is  characterized  as  slow, 
effortful,  capacity-limited,  and  largely  under  subject  control.  Automatic  processing  Is 
characterized  as  fast,  parallel,  relatively  effortless,  and  largely  not  under  subject  control. 
The  process’ng  demands  of  most  complex  tasks  typically  reflect  neither  purely  automatic 
or  controhed  processing,  but  rather  a  mixture  of  the  two. 

A  simple  application  of  automatic/control  processing  theory  to  dual-  or  multi-task 
training  might  advocate  practicing  consistent  single-task  components  first,  prior  to 
having  the  learner  perform  the  tasks  concurrently.  In  single-task  training  components 
become  automatic,  no  longer  requiring  attention.  They  could  then  be  combined  to 
perform  dual  tasks.  However,  the  data  are  Incompatible  with  this  simple  view.  In  the 
Schneider  and  Fisk  (1984)  experiment,  both  the  consistent  and  Inconsistent  tasks 
dropped  nearly  equally  In  the  first  dual-task  block. 

Rleck.  Ogden,  and  Anderson  (1980)  raised  several  Issues  concerning  the  relative 
effectiveness  of  single-  and  dual-task  practice  on  subsequent  dual-task  performance.  In 
the  training  phase  of  their  experiment,  subjects  received  either  four  3-minute  practice 
trials  on  a  discrete  compensatory  tracking  task,  four  trials  of  the  tracking  task  combined 
with  a  digit  cancellation  task,  or  three  different  practice  combinations  of  the  two  tasks. 
Then  each  group  performed  one  3-mlnute  transfer  trial  In  which  a  discrete  tracking  task 
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was  performed  together  with  a  delayed  choice  reaction  time  task.  This  was  followed  by 
a  second  transfer  trial  In  which  8  continuous  tracking  task  was  Joined  with  a  choice 
reaction  time  task.  Rleck  et  al.  found  that  practice  on  the  single-tasks  had  little 
influence  on  subsequent  dual-task  performance.  In  contrast,  dual-task  practice  did  have 
a  decided  lnfluence-the  greater  the  prior  dual-task  practice,  the  greater  the  Influence. 
In  sum.  the  extent  of  dual-task  practice  accounted  for  28  to  35%  of  the  dual-task 
variance,  whereas  the  single-task  practice  accounted  for  only  2%.  Although  we  caution 
against  overinterpreting  results  based  on  such  small  amounts  of  practice,  we  nevertheless 
feel  they  are  suggestive  of  the  potential  utility  of  dual-task  practice,  especially  In  view  of 
the  fact  that  the  transfer  tasks  were  not  the  same  as  the  practiced  tasks. 

Several  studies  have  shown  that  performance  on  multiple  tasks  predicts  pilots’ 
performance  better  than  comparable  performance  on  single  tasks.  For  example,  North 
and  Gopher  (1976)  developed  a  test  battery  consisting  of  digit-processing,  reaction-time, 
and  one-dimensional  compensatory  tracking  tasks.  In  the  first  phase  of  their  study 
subjects  performed  the  three  tasks  separately  under  adaptive  conditions  designed  to  elicit 
maximum  baseline  measures  for  each  task.  In  the  second  phase  task  pairs  were 
performed  concurrently  with  both  tasks  given  equal  priority.  And  In  the  final  phase, 
subjects  performed  different  combinations  of  tasks  varying  In  priority.  In  general,  the 
correlations  obtained  between  the  single-  and  dual-task  conditions  were  low,  suggesting 
that  performance  on  single  tasks  Is  a  poor  predictor  or  performance  on  multiple  tasks. 
Damos  (1978)  and  Damos  and  Ltntern  (1981)  offer  further  evidence  that  multiple-task 
performance  Is  a  better  predictor  of  success  In  flight  training  than  Is  single-task 
performance. 


Part- Whole  Training 

Part-task  training  refers  to  practice  on  a  subset  of  components  comprising  a  whole 
task.  The  logic  behind  part-task  training  Is  that  learning  can  proceed  more  efficiently 
and  hardware  expenditures  can  be  reduced,  e.g.,  using  part-task  trainers  or  simulators. 
Many  whole  tasks  are  multi-task  situations.  For  example,  flying  an  aircraft  may  require 
performing  flight  control,  communication,  and  nagivatlon  tasks.  Studies  of  part-whole 
training  might  provide  Insights  with  respect  to  how  much  part-task  training  transfers  to 
multi-task  situations.  Unfortunately,  despite  the  fact  that  a  large  number  of  studies 
have  appeared  since  the  first  documented  experimental  Investigation  of  part  versus  whole 
learning  appeared  In  1900,  an  accompanying  framework  has  not  emerged  that  identifies 
key  experimental  dimensions  to  explain  conflicting  results,  or  a  valid  and  reliable  set  of 
training  principles  or  guidelines  (see  McGeocb,  1931;  McGeoch  &  Irion,  1952;  Naylor, 
1962;  Stammers,  1982;  and  Wlghtman  &  Llntern,  1985).  Recently  Stammers  (1982) 
tested  the  generality  or  the  prevailing  principles. 

Stammers  sought  to  test  a  set  of  principles  proposed  by  Naylor  and  his  co-workers 
(e  g.,  Blum  &.  Naylor,  1968,  Naylor,  1962,  and  Naylor  &  Briggs,  1963)  and  Annett  and 
Kay  (1956).  The  so-called  Naylor  hypothesis  Is  embodied  In  two  principles:  l)  as 
complexity  Is  Increased  for  relatively  highly  organized  tasks,  training  the  whole  task 
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should  work  better  than  training  parts  of  the  task;  and  2)  as  complexity  Is  Increased  for 
relatively  less  organized  tasks,  training  parts  should  become  more  efficient  than  training 
the  whole  task.  Annett  and  Kay's  principle  suggests  that  the  more  Independent  a  task's 
parts,  the  more  it  should  be  learned  as  a  whole;  the  more  Interdependent  a  task's  parts, 
the  more  Its  parts  should  be  separated  for  training.  Stammers  conducted  four 
experiments  to  evaluate  the  generality  of  these  principles  by  evaluating  part  and  whole 
learning  In  a  procedural  control  panel  task  and  a  list  learning  task  made  up  of 
operational  Instructions. 

On  the  whole.  Stammers  found  little  support  for  the  notion  that  practicing  parts  of 
a  task  produces  advantages  over  practicing  the  whole  task.  Differences  between  training 
groups  were  small,  with  whole  training  gaining  a  slight  advantage.  Nevertheless,  he 
cautions  that  deciding  to  use  one  form  of  training  over  another  should  be  made  on  the 
basis  of  empirical  observations,  rather  than  some  analytic  principles.  Although 
Stammers  has  helped  to  Illuminate  a  literature  filled  with  conflicting  results,  It  should  be 
noted  that  neither  the  studies  reviewed,  nor  his  own  experiments  adequately  address  the 
issue  of  task  interdependence  and  complexity  as  It  pertains  to  the  relative  efficacy  of 
part-task  practice  for  concurrent  tasks. 

Several  researchers  examining  part-whole  learning  In  dual-task  situations 
emphasize  whole-task  practice.  Aji  alternative  to  decomposing  a  complex  task  Into  Its 
component  parts  and  practicing  them  separately  exists  In  the  form  of  adaptive  training. 
In  adaptive  training  the  task  Is  first  simplified  and  IS  then  made  progressively  more 
difficult  as  the  learner  acquires  greater  levels  of  expertise  (see  Johnson  &  Haygood,  1984: 
Kelly,  1969;  and  Llntern  &  Gopher.  1978).  Typically  the  learner  Is  exposed  to  the  whole 
task  or  almost  the  whole  task  to  be  mastered.  However,  differential  pay-off  conditions 
emphasize  some  tasks  over  others.  In  this  way,  each  component  Is  practiced  In  the 
context  of  the  whole  task.  Johnson  and  Haygood  (1984)  have  found  that  progressively 
challenging  the  learner  In  a  primary  simulated  driving  task  (tracking)  and  a  secondary 
visual  detection  task  resulted  In  better  performance  than  training  In  the  single-task 
conditions. 

In  general,  the  part-whole  training  literature  does  not  provide  a  strong  support  for 
either  part  or  whole  training.  Nor  does  It  identify  critical  variables  that  can  predict 
when  and  how  much  part-task  training  should  precede  multi-task  training.  In  general, 
most  complex,  real-world  skills,  e  g.,  piloting,  driving,  and  programming,  are  initially 
developed  via  part-task  training  until  at  least  some  basic  level  of  proficiency  Is  reached 
before  multi-task  training  Is  begun. 

Frameworks  for  single-  to  multi- task  transfer 

.Although  psychology  can  offer  a  variety  of  theoretical  frameworks  for  multi-task 
performance,  there  are  no  models  that  predict  single-  to  multi-task  transfer.  Theories  of 
multi-task  performance  generally  Involve  switching,  e.g.,  Broadbent  (1958),  or  allocation 
of  some  limited  resource,  e  g.,  Kahneman  (1973).  Recent  models  have  elaborated  the 
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nature  of  resource  sharing,  e.g.,  Navon  &  Gopher  (1085),  or  attempted  to  stratify  the 
types  of  resources,  e.g.,  Wlckens  (1980,  1084b).  These  frameworks  have  generally  not 
been  applied  to  practice  effects.  From  switching  theories  one  might  assume  that  practice 
speeds  up  the  switching  rate  and  facilitates  learning  what  orders  and  rates  of  switching 
are  most  effective.  From  resource  theories  one  might  assume  practice  facilitates  learning 
what  proportions  and  types  of  resources  to  allocate  to  a  task.  In  dual-task  training,  an 
Individual  can  learn  what  combination  of  resource  allocation  produces  optimum 
performance.  For  example,  If  a  memory  task  can  be  performed  spatially  or  verbally, 
training  with  a  concurrent  spatial  task  will  encourage  the  subject  to  perform  the  memory 
task  using  verbal  coding.  Wlckens  (1084a)  comments  that  practice  can  Increase  the 
resource  efficiency,  so  that  the  same  task  can  be  accomplished  with  progressively  less 
resources.  The  mechanism  for  this  practice  effect  has  not  been  specified. 

Recent  frameworks  that  predict  learning  effects  in  single-task  experiments  suggest 
that  single-task  training  should  transfer  to  dual  tasks.  Pew  (1074),  for  example, 
extended  previous  models  of  tracking  and  suggested  that  humans  develop  higher-order 
control  mechanisms.  In  tracking  tasks,  novices  continuously  monitor  the  feedback  at  a 
low  level  of  control,  e.g.,  monitoring  momentary  error  between  actual  and  Intended 
position  of  the  object  being  controlled.  With  practice  attention  moves  to  higher  levels, 
e  g.,  monitoring  the  drift  between  the  control  equations  for  the  desired  and  Intended 
positions.  Shlffrln  and  Schneider  (1077)  extended  a  long  line  of  argument  ~  from  at  least 
James  (1890)  --  that  practicing  consistent  task  components  develops  automatic 
components  that  require  little  If  any  attentlonal  resources.  Anderson  (1083)  suggested 
that  practice  compiled  productions,  l  e.,  If-then  rules,  produce  automatic  component 
skills.  Norman  and  Shalllce  (1985)  proposed  that  practice  reduces  Interference  by 
allowing  processes  to  function  without  attention.  Hunt  and  Lansman  (1088)  presented  a 
computer  simulation  model  incorporating  production  systems  and  direct  activation 
functions.  In  this  model,  the  system  begins  executing  ge*  -ral  productions  by  comparing 
the  Input  to  the  desired  Input.  On  a  match,  the  appropriate  response  production  Is 
released.  With  practice,  direct  associations  develop  between  Input  and  output,  allowing 
tin  Input  to  evoke  the  output  without  requiring  the  attention-based  comparison.  Hunt 
and  Lansman 's  model  accounts  for  some  dual-task  procedures  as  a  result  of  time-sharing 
a  limited  executive  control  system. 

None  of  the  present  frameworks  directly  address  the  Issue  of  transfer  from  single  to 
multiple  tasks.  To  understand  Issues  related  to  transfer,  much  more  detailed  models  are 
required.  Even  the  limited  data  on  transfer  are  ambiguous.  Models  are  needed  that  can 
speciry  boundary  conditions  and  predict  the  appropriateness  and  marginal  utility  of 
various  types  of  practice  to  multi-task  performance.  The  following  model  Is  an  Initial 
step  of  elaborating  a  model  to  make  such  predictions. 

Overview:  A  Connectionist/Control  Architecture 

The  proposed  connections  control  architecture  provides  a  runnable  simulation 
model  for  representing  a  class  of  models  of  dual-task  performance  and  workload  (see 
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Schneider  &  Mumme,  1987;  Schneider  &  Detweller,  1987).  We  first  provide  a  brief 
overview  of  the  connections  control  architecture.  Connections  models  (see  Schneider. 
1987)  represent  a  radical  departure  from  energy-based  metaphors,  e  g.,  Kahneman 
(1973).  We  recommend  readers  new  to  this  type  of  modeling  carefully  work  through  the 
diagrams  and  examples  In  the  text.  After  the  overview,  we  will  Illustrate  how  skill  Is 
acquired  within  this  model  In  both  single-  and  dual-task  processing. 

This  architecture  Incorporates  a  variety  of  processing  elements  that  provide 
mechanisms  for  accomplishing  stable  Information  processing  of  real-world  tasks  in  an 
architecture  that  is  neurally  feasible  (see  Schneider  &  Detweller,  1987).  The  model 
Implements  a  five-phase  account  of  skill  acquisition  In  which  skill  development  Is 
characterized  as  gradual  and  continuous.  Five  assumptions  underly  and  constrain  this 
architecture.  First,  we  assume  that  Information  Is  processed  In  networks  of  neural-Uke 
units.  These  units  are  organized  Into  modules,  and  sets  of  modules  are  organized  Into 
levels  and  regions.  The  various  regions,  e.g.,  auditory,  speech,  lexical,  etc.  are  connected 
to  and  communicate  with  one  another.  Second,  we  assume  the  modules  exhibit  local 
specialization  of  function,  l.e.,  processing  only  a  restricted  class  of  inputs.  Third,  we 
assume  knowledge  ts  stored  In  the  connection  weights  among  the  neural-Uke  units.  As 
skill  Is  acquired  due  to  learning,  changes  In  knowledge  are  reflected  In  changes  In  the 
connection  strengths  among  units  and/or  by  the  size  of  their  weights.  Fourth,  we 
assume  the  connection  weights  operate  under  the  Influence  of  a  variety  of  learning  rate 
constants.  These  constants  determine  how  quickly  the  connections  change  as  a  result  of 
Intervening  learning  and  the  duration  of  the  retention  interval.  And  firth,  we  assume  a 
control  processing  system  modulates  the  transmission  of  Information  within  and  between 
processing  regions.  This  architecture  Is  a  variation  of  the  CAPi  system  designed  to 
model  automatic  and  controlled  processing  (see  Schneider  &  Shlffrln,  1977;  Schneider  & 
Mumme.  1987;  and  Shlffrln  &  Schneider,  1977). 

The  connectlonlst/control  architecture  can  be  described  at  three  levels  of  scale. 
The  microlevel  structure  Is  the  first  level  and  represents  a  network  of  neural-Uke  units 
that  account  for  associative  information  processing  and  a  range  of  attentlonal 
phenomena  (see  Figure  l).  A  message  vector  Is  an  output  from  one  module  and  an  Input 
to  another  module,  transferring  information  between  modules.  A  vector  Is  a  set  of 
activities  of  the  units  within  a  module.  For  example,  the  letter  •A*  might  be  coded  as  a 
message  vector  0,1, 1.1,1,  where  the  0's  and  i  s  represent  the  absence  and  presence  of 
features,  eg.,  vertical  lines,  horizontal  lines,  backward  slant  and  forward  slant.  The 
message  vector  Is  the  set  of  activation  values.  Information  flowing  from  a  module  Is 
regulated  by  an  attenuation  unit  within  each  module  (see  Figure  l;  Schneider  &  Mumme, 
1987;  and  Schneider  &  Detweller,  1987  for  details).  Attention  Is  a  scaler  multiplication 
of  the  activity  of  all  of  the  units,  eg.,  if  half  attending  to  the  letter  •A*,  the  output 
would  be  0,. 5. .5, .5, .5, ,5.  The  macrolevel  structure  Is  the  second  level  of  scale  and 
represents  Interactions  among  a  set  of  modules  (see  Figure  2B).  The  modules  are 
organized  as  levels  and  regions  of  processing.  The  levels  correspond  to  successive 
processing  stages.  For  example,  the  visual  module  Is  assumed  to  consist  of  a  series  of 
stages  that  Include  units  such  as  features,  characters,  and  words.  The  system-level 
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structure  is  tbe  third  level  of  scale  and  represents  the  Interactions  among  processing 
regions  made  up  of  modules  (see  Figure  2).  Each  region  consists  of  a  series  of  levels  of 
modules  and  their  respective  control  structures.  There  are  regions  specializing  In  input, 
e.g.,  visual,  auditory  ect.,  output,  e.g.,  motor,  speech,  etc.,  and  associative  processing, 
e  g.  semantic,  spatial,  context,  etc.  The  Innermost  levels  of  each  region  communicate, 
l.e.,  pass  vector  messages,  to  other  regions. 

Insert  Figure  1  about  here 


Ail  of  tbe  regions  communication  with  each  other.  Regions  communicate  on  an 
Innerloop  of  associative  connections  (see  Figure  2).  Each  module  Is  assumed  to  have 
associative  connections  to  the  other  modules  on  the  Innerloop.1  These  connections  allow 
each  module  to  send  message  vectors  to  other  modules.  There  are  separate  connections 
to  each  receiving  module.  These  connections  can  transform  one  output  transmission  to 
different  messages  In  different  regions  In  parallel,  e.g.,  a  bright  flash  may  transmit  from 
tbe  visual  system  and  assoctatlvely  evoke  a  startle  response  In  a  motor  module,  a  fear 
response  In  a  mood  module,  and  an  attempt  to  retrieve  related  Information  in  a  semantic 
module.  The  fact  that  each  module  on  the  Innerloop  has  Its  own  connections  also 
enables  multiple  modules  to  transmit  messages  simultaneously,  e.g.,  tbe  visual  module 
may  transmit  to  tbe  motor  module  while  the  auditory  module  transmits  to  the  speech 
module. 


Insert  Figure  2  about  here 


Note,  parallel  transmission  on  tbe  Innerloop  does  not  necessarily  imply  parallel 
processing.  All  the  message  vectors  coming  Into  a  module  are  summed  (see  Figure  1). 
When  two  messages  are  added,  inter-message  Interference  results.  This  la  analogous  to  a 
cocktail  party  situation.  Every  speaker  can  speak  at  loud  volume  for  extended  periods  or 
time  —  there  Is  no  energy  limit  on  output.  However,  each  listener  receives  the 
summation  of  all  spoken  messages.  Parallel  transmissions  may  result  In  high  Interference 
such  that  no  message  Is  received  (see  below).  In  such  cases  more  information  Is  conveyed 
If  Individuals  speak  sequentially,  even  though  each  has  tbe  power  to  speak  In  parallel. 
Within  the  proposed  architecture,  tbe  control  processing  Is  tbe  mechanism  by  which 
CAPl  moderates  message  transmissions  on  the  Innerloop. 

There  are  two  categories  of  information  flow  In  the  system;  these  are  message  and 
control  Information  flow.  Message  flow  Involves  the  transmission  of  a  vector 
representing  a  code  from  one  module  to  another,  e.g.,  the  visual  module  sending  a  vector 


1Thls  Is  a  simplifying  assumption  In  reasonable  speculation  based  on  neuroanatomy  (Mishkin  & 
Appenzeller.  1887).  Most  of  the  arguments  made  in  this  paper  will  gpply  to  lattice  architectures  that 
Involve  substantial  message  convergence  between  messages  transmitting  on  the  Innerloop. 
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coding  the  features  of  the  letter  “A*  to  the  semantic  module.  Control  flow  involves 
exchange  of  control  information  between  the  modules  and  a  control  structure  (see  Figure 
2B).  Control  Information  codes  the  Importance  of  messages  waiting  to  transmit  and  the 
transmission  state  of  any  of  the  modules,  e.g.,  signals  Indicating  bow  active  a  module  Is. 
how  Important  a  message  to  be  sent  Is,  or  how  much  to  attenuate  the  message.  At  the 
macro-  and  system-levels,  control  structures  receive  control  information  to  moderate 
Information  flow,  e  g.,  sequencing  transmissions  to  reduce  interference.  At  the  system 
level  the  control  flow  processing  Involves  a  central  control  structure  (see  Figure  2).  This 
structure  receives  activity  reports  from  all  regions.  These  reports  are  a  single  number 
from  each  module  which  Indicates  the  importance  of  the  message  In  the  module  awaiting 
transmission.  This  is  analogous  to  each  module  raising  Its  hand,  the  more  critical  the 
message  the  higher  the  band.  The  control  structure  then  ranks  the  requests  for 
transmission  and  allows  the  module  with  the  highest  priority  request  to  transmit.  A 
three-neuron  per  module  circuit  (shown  in  the  bottom  of  Figure  l)  Is  sufficient  to 
accomplish  this  control  function. 

Control  processing  Is  critical  for  performing  novel  tasks.  The  connectlonlst/control 
architecture  can  perform  a  novel  task  following  verbal  Instruction  (see  Schneider  & 
Mumme,  1087).  This  Is  accomplished  by  loading  vectors  Into  modules,  comparing  Input 
vectors  to  vectors  held  In  working  memory,  and  releasing  output  vectors  (see  below). 
This  processing  Is  slow,  serial,  and  effortful,  l.e.,  It  requires  many  shifts  of  which  modules 
are  allowed  to  transmit  (see  below).  The  control  Information  necessary  to  limit  message 
interference  also  euables  execution  of  verbal  rules.  With  each  execution  of  a  verbal  rule, 
associative  connections  between  the  modules  change  such  that  the  Input  will  evoke  the 
output  without  moderation  by  control  processing.  This  transition  (see  below)  Is  the 
mechanism  through  which  processes  become  automatic. 

In  this  architecture,  messages  need  not  all  pass  through  a  central  executive;  rather, 
regions  can  communicate  directly  with  other  regions.  Since  the  transmission  of 
concurrent  messages  may  cause  Interference,  the  central  control  structure  may  moderate 
transmission  among  regions.  Sequencing  messages  In  this  way  has  the  potential  of 
causing  delays  or  omissions.  The  model  predicts  many  of  the  Item  and  order  loss 
phenomena  witnessed  under  conditions  of  high  workload. 

A  context  storage  mechanism  enables  the  system  to  model  the  stability  of  human 
processing  and  to  mimic  a  variety  of  learning  phenomena,  e  g.,  episodic  memory  and 
remindings  (see  Schneider  &  Detweller,  1087).  This  context  mechanism  acts  to  associate 
the  contents  of  the  messages  on  the  lnnerloop  to  the  temporal  context  In  which  they 
occur.  The  context  vector  has  separate  connections  to  all  the  other  modules  on  the 
lnnerloop  (see  Figure  2),  One  context  vector  can  evoke  a  different  message  vector  on 
every  module  on  the  lnnerloop.  When  Information  vectors  within  modules  decay  or  are 
displaced,  modules  can  be  reloaded  by  retransmitting  the  context  vector.  The  ability  to 
reload  Information  via  transmitting  the  context  vector  reduces  the  chances  of 
catastrophic  processing  failure.  Further,  when  the  system  Is  confronted  with  high  levels 
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of  workload,  context  can  temporarily  store  associations  to  vectors  and  enable  the 
reloading  of  a  low  priority  task  delayed  during  the  processing  of  high  priority  tasks. 

Phases  of  skill  acquisition 

The  connectlonlst/control  architecture  has  been  Implemented  as  a  computer 
simulation  called  Controlled/Automatic  Processing  Model  l  (CAPi)  (see  Schneider  & 
Mumme,  1987;  Schneider  &  Detweller,  1987).  The  simulation  can  perform  single  and 
dual  tasks  comparable  to  typical  laboratory  dual  tasks.  We  will  describe  the  model  and 
the  phases  It  exhibits  as  It  learns  to  perform  single  and  dual  tasks.  It  important  that  the 
reader  try  to  conceptualize  the  operations  of  the  model2.  Most  of  the  Implications  follow 
from  the  conceptual  architecture  of  the  model.  The  details  of  the  simulation  provide  an 
existence  proof  that  this  type  of  architecture  will  produce  the  phenomena  discussed. 

The  model  Is  a  connectionist  simulation  model;  details,  equations  and  rationale  for 
the  architecture  can  be  found  In  Schneider  &  Mumme  (1987)  or  Schneider  &  Detweller 
(1987).  The  model  simulates  the  behavior  of  populations  of  neuraMlke  units.  It 
incorporates  the  associative  and  autossoclative  models  of  J.  A.  Anderson's  (1983)  brain* 
state-ln-a-box  model.  The  major  components  are  Illustrated  In  Figure  l.  Each  module  Is 
made  up  of  a  200-element  vector  of  output  units.  Output  units  are  neuron-Uke  units  In 
which  the  pattern  of  active  units  code  the  response.  Information  is  coded  as  a  vector  of 
-i  s  (Inactive  units)  and  +  l's  (active  units).  Each  output  unit  sums  Its  Input  linearly 
with  a  decayed  value  of  the  activity  of  the  previous  Iteration.  The  output  of  each  unit  Is 
a  logistic  function  of  the  input  with  a  minimum  activation  or  -1.3  and  maximum 
activation  or  +1.3.  e.g.,  connecting  within  the  set.  Each  output  unit  decays  to  some 
proportion  (typically  .9)  of  Its  activity  on  the  previous  cycle.  Each  output  unit  connects 
autoassoclatlvely  to  half  the  other  output  units  of  the  module.  Autoassoclatlve  feedback 
varies  from  0  for  receiving  a  new  vector  to  .6  to  latch  a  received  vector.3  Each  output 
unit  connects  to  half  of  the  units  of  other  modules  to  which  It  is  connected.  The 
autoassoclatlve  and  associative  connection  matrices  are  each  made  up  of  20,000 
connections  per  module.4  Ail  associations  were  Initialized  at  zero  strength. 

Two  types  of  learning  were  accomplished  using  a  modified  Hebb-type  learning  rule, 
the  delta  or  Wldrow-Hoff  learning  rule  (see  J.  A.  Anderson,  1983).  The  first  type, 
associative  learning,  modifies  connections  between  modules.  This  Involves  changing  the 
connection  weights  between  the  input  and  output  units  such  that  the  Input  comes  to 
elicit  the  output.  The  second  type,  priority  learning,  occurs  within  a  module.  This 


2 

Walter  Schneider  has  produced  an  animation  of  the  model  that  runs  on  an  CBM  PC  with  a  color 
monitor.  It  provides  a  dynamic  two-dimensional  representation  of  the  system.  Illustrating  the  stages  of 
skill  acquisition  and  the  problems  and  compensatory  activities  to  enable  dual-task  performance. 

3 

A  feedback  strength  of  1.0  Is  one  In  which  the  feedback  signal  Is  as  strong  as  the  Input  signal. 

4These  results  generalize  to  larger  matrices  typical  of  cortical  hypercolumns  (Mountcastle,  1979). 
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involves  changing  the  autoassoclatlve  weights  between  the  Input  unit  and  the  priority 
unit,  stimuli  that  are  consistently  attended  strongly  activate  the  priority  unit  which 
results  In  the  vector  being  transmitted  from  the  module.  Stimuli  that  are  not  attended, 
weakly  activate  the  priority  unit  and  Inhibit  the  vector  from  being  transmitted  unless 
control  processing  enables  transmission  of  the  vector.  Attention  Is  Implemented  by  the 
strength  of  the  attenuation  unit  (see  Figure  l).  The  attenuation  unit  is  multiplied  by  the 
strength  of  the  output  unit  between  o  (unattended  output)  to  l  (fully  attended  output). 
Controlled  processing  Involves  modulating  attenuation  and  monitoring  the  activity  of  the 
modules  (see  below).  Automatic  processing  Involves  activating  the  priority  unit 
sufficiently  to  reliably  transmit  the  vector  to  the  next  stage  of  processing  In  the  absence 
of  controlled  processing  Input. 

Development  of  automatic  processing  In  a  single  task 

The  CAPi  simulation  has  been  used  to  model  single-task  skill  acquisition.  Details 
of  the  modeling  can  be  found  in  Schneider  and  Mumme  (1987).  In  this  paper  we  present 
only  an  overview  of  single-task  learning  and  focus  on  dual-task  performance. 

CAPl  acquires  skills  through  five  phases.  The  movement  between  these  phases  is  a 
gradual,  continuous  transition.  The  rate  of  movement  between  stages  depends  on  the 
task  to  be  learned.  We  will  illustrate  the  transitions  using  numbers  based  on  subjective 
impressions  from  work  in  search  paradigms  (Schneider  &  Fisk,  1984).  and  learning  logic 
gates  from  electronic  troubleshooting  (Carlson  &  Schneider,  1987).  The  phases  are: 

1.  Controlled  comparison  from  buffered  memory  (Trials  1-4) 

2.  Context-maintained  controlled  comparison  (Trials  5-20) 

3.  Goal-state-maintained  controlled  comparison  (Trials  21-100) 

4.  Controlled-asslst  of  automatic  processing  (Trials  101-200) 

5  Automatic  processing,  (after  Trials  200  per  component  task). 

The  first  three  phases  involve  extensive  attentional  processing.  In  Phases  1-3,  the 
subject  serially  compares  stimulus  Information  with  Information  In  memory.  Once  the 
skill  has  shifted  to  Phase  4,  there  Is  a  substantial  reduction  In  attentional  processing  and 
a  qualitative  change  in  the  processing. 

Phases  4  and  5  Involve  direct  associative  retrieval  of  output  patterns  from  Input 
patterns  at  a  series  of  processing  levels.  We  will  Illustrate  single-task  learning  using  a 
category  search  task  patterned  after  Fisk  and  Schneider  (1983).  In  this  task  the  subject 
must  remember  one  or  more  categories  and  respond  to  a  visually  presented  word  by 
Indicating  whether  the  word  Is  a  member  of  the  remembered  categories.  Subjects'  initial 
performance  is  slow,  e.g.,  200  ms  per  comparison,  serial,  e.g.,  the  "no*  responses  are 
twice  as  slow  as  the  •yes*  responses,  and  effortful,  e.g.,  subjects  perform  poorly  under  a 
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secondary-task  load.  This  processing  characterizes  controlled  processing.  In  a  consistent 
mapping  (CM)  search  task  subjects  make  the  same  response  to  the  same  stimulus  over 
trials.  Practice  in  consistent  search  results  In  processing  that  is  fast,  e.g.,  2  ms  per 
comparison,  parallel,  e.g.,  equal  •yes/no*  slopes,  and  low  in  effort,  e.g.,  subjects  can 
perform  a  secondary  task  (see  Fisk  &  Schneider,  1883).  This  processing  characterizes 
automatic  processing.  In  contrast  to  consistent  practice,  practice  In  a  varied  mapping 
(VM)  task,  where  the  categories  searched  for  on  one  trial  become  distractors  on  the  next, 
shows  little  performance  change.  After  ten  hours  of  VM  practice,  performance  was 
about  as  slow,  serial  and  effortful  at  the  end  of  training  as  in  the  beginning  (e.g.,  Fisk  & 
Schneider,  1983). 

Phase  1.  "  Controlled  comparison  from  buffered  memory  Involves  loading  and 
maintaining  memory  vectors  In  modules,  comparing  comparison  vectors  to  the  input 
vectors,  and  releasing  the  appropriate  response  vector.  CAPl  performs  a  controlled 
comparison  by  adding  two  vectors  to  determine  their  similarity.  Vector  addition  entails 
that  each  output  unit  (Figure  l)  adds  the  Inputs  It  receives.  The  report  cell  receives  the 
square  or  the  sum  of  the  absolute  value  of  all  of  the  output  units.*  To  perform  a 
comparison,  two  vectors  are  added  together.  If  the  two  vectors  are  similar,  e.g.,  "cat" 
and  ■animal* ,  the  added  vector  Is  nearly  twice  as  long,  since  both  vectors  are  pointing  In 
the  same  direction.  In  contrast.  If  the  two  vectors  are  dissimilar,  e.g.,  'car*  and 
■animal",  the  vector  elements  add  orthogonally,  pointing  at  right  angles.  This  produces 
a  shorter  vector,  e.g.,  of  length  1.41  for  vectors  with  correlation  of  0.  The  dirrerence  In 
length  provides  a  criterion  for  match,  e.g.,  a  length  greater  than  1.85  (correlation  =  .5) 
Is  defined  as  a  match.  Processing  Is  serial  because  adding  triplets  of  vectors  produces  a 
more  error-prone  comparison  process  (see  Schneider  &  Mumme,  1987). 

Phase  l  processing  Is  very  effortful  in  the  sense  that  It  requires  many  shifts  of 
attention  and  monitoring  the  received  activity  of  vectors.  For  example,  to  perform  a 
two-category  search  task  with  a  display  size  of  two.  the  subject  must  actively  maintain 
the  two  category  vectors  and  two  response  vectors  (•yes*  and  *no*  responses),  shift 
attention  four  times  (2  visual  stimuli  x  2  categories),  compare  the  length  of  the  added 
vectors  to  the  criterion  length  four  times,  shift  attention  to  the  output  module,  and 
release  a  response  vector.  Phase  1  performance  Is  very  error  prone.  If  the  subject  Is 
Interrupted,  the  vectors  In  the  buffers  will  decay,  causing  errors.  If  the  attention¬ 
switching  operations  are  disturbed,  e.g.,  by  a  secondary  task,  the  comparisons  cannot  be 
made,  resulting  In  response  delays  or  omissions. 


Insert  Figure  3  about  here 


5The  equation  Tor  the  addition  of  the  vector  Is  SQRT(X2  +  Y2  +  2r  X  Y),  where  X  and  Y  are  the 

A  J 

vectors,  and  r  Is  the  correlation  of  the  two  vectors, 
x  y 
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Phase  2  ~  Coptext-malntalped  controlled  comparison  Is  similar  to  Phase  l,  except 
information  Is  maintained  in  fast  learning  weights  that  associate  vectors  stored  In 
modules  to  the  context.  Activating  the  context  module  can  refresh  Information  in 
modules  (see  Schneider  &  Detweiler,  1087).®  The  connections  between  the  context 
module  and  the  modules  on  the  lnnerloop  are  assumed  to  be  fast  learning  weights  (see 
Figure  2).  These  connections  can  quickly  associate  the  context  to  the  current  contents  of 
modules  on  the  lnnerloop.  After  a  small  number  of  trials,  e.g.,  four  repetitions,  the 
context  can  evoke  the  category  and  response  vectors.  This  context  storage  mechanism 
allows  the  system  to  re-actlvate  the  vectors  when  new  visual  stimuli  are  presented.  After 
four  trials  of  searching  for  the  same  categories,  subjects  can  perform  a  distractor  task, 
e  g.,  counting  backwards  by  3  s  between  trials,  without  performance  being  disrupted. 
Klapp.  Marshburn.  &  Lester  (1083)  have  demonstrated  that  if  subjects  rehearsed  the 
memory  set  for  five  seconds  in  a  memory  scanning  experiment,  performing  a  digit-span 
task  between  the  presentation  of  the  memory  set  and  the  probe  display  did  not  disrupt 
performance. 

Phase  2  processing  Is  effortful  and  requires  attention,  but  It  will  not  be  as  seriously 
disrupted  by  Interpolated  tasks.  Performing  a  category  search  task  requires  as  many 
shifts  of  attention  and  comparisons  as  In  Phase  l.  However,  the  context  can  now  be  used 
to  activate  the  vectors  In  the  buffers.  Note,  the  context  storage  only  works  for  runs  of 
the  same  task  and  Is  of  little  value  If  what  the  subject  Is  searching  for  changes  from  trial 
to  trial.  Fast  connection  learning  weights  show  very  serious  proactive  interference 
effects  (see  Schneider  &.  Detweiler.  1987)  and  provide  little  information  about  previous 
associations  If  new  associations  are  made  to  the  same  context.  During  Phase  2, 
processing  is  very  reliable  for  runs  of  the  same  comparison  set,  but  reverts  back  to  Phase 
1  with  every  change  In  the  set  to  be  searched  for. 

Phase  3  ~  Goal-state-malntalned  controlled  comparison  Is  similar  to  Phase  2  except 
the  goal  state  can  reload  the  modules  In  addition  to  the  context-based  reloading.  For 
example,  assume  the  subject  has  to  learn  three  rules  A,  B,  C.  In  Phase  2,  If  the  subject 
performs  a  series  of  A  trials,  the  A  vectors  become  associated  to  the  context  and  can  be 
reloaded  if  the  information  decays  from  the  buffer.  However,  on  the  first  trial  with  the 
B  rule,  the  subject  must  be  told  what  vectors  to  load  In  the  buffers.  This  loading 
associates  the  B  vector  to  the  context,  making  the  A  vectors  less  available.  In  Phase  3, 
the  subject  learns  to  associate  appropriate  vectors  to  multiple  goal  states  A,  B,  and  C, 
Instead  of  a  single  context  vector.  When  the  task  changes,  the  subject  needs  only  a  short 
time  to  remind  blm/herself  about  the  rule  to  reload  the  buffers.  The  subject  then 
performs  the  same  attentlonal  operations  as  in  Phase  l. 


6Thls  context  learning  mechanism  Is  Important  for  robust  processing  and  enables  the  model  to  account 
for  a  wide  range  of  working  memory  phenomena,  e.g.,  episodic  and  semantic  memory,  retroactive  and 
proactive  Interference,  elaboratlve  rehearsal,  and  release  from  proactive  Interference  effects;  It  Is  consistent 
with  physiological  data  suggesting  the  existence  of  a  two-speed  learning  system  (see  Mishkin,  Malamut.  ft 
Bachevaller.  1984). 
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Phase  3  processing  requires  attention  and  Involves  the  slow,  serial,  effortful  form  of 
processing  characterized  In  Phases  l  and  2.  It  is  somewhat  more  reliable,  In  that 
performing  an  Interpolated  task  In  the  same  modules  will  not  disrupt  performance.  The 
difficulty  subjects  have  going  from  practicing  a  single  rule  to  multiple  rules  provides 
evidence  for  the  presence  of  Phase  3  processing.  Carlson  and  Schneider  (1987)  found 
subjects  could  perform  single  digital  logic  gate  Judgement  tasks,  e.g..  the  output  of  an 
AND  gate  Is  high  If  all  Inputs  are  high,  in  .7  seconds  If  the  rules  were  massed  after  about 
5  trials.  However,  over  1000  trials  per  gate  were  needed  before  they  could  perform  trials 
of  mixed  gate  types,  e.g.,  AND,  OR,  NOR,  etc.  at  that  speed. 

Phase  4  ~  Controlled  assist  of  automatic  processing  produces  a  dramatic  reduction 
in  processing  time  and  effort  relative  to  the  previous  three  phases  (see  Figure  3).  During 
Phases  1-3  the  subject  repeatedly  compared  the  Input  to  the  vectors  In  the  buffers.  The 
match  processes  involved  four  comparisons.  A  match  was  followed  by  a  yes  response,  a 
non-match  by  a  negative  response.  In  Phase  4,  associative  learning  alters  the 
connections  between  the  input  and  output  modules  such  that  the  input  evokes  the 
output  (see  Schneider  &  Mumme,  1987),  e.g..  if  the  stimulus  •cat*  is  transmitted,  the 
learned  associative  responses  evoke  the  “index  finger*  response.  The  resulting 
associative  process  eliminates  the  need  for  controlled  comparison.  If  a  target  vector  Is 
transmitted  from  the  input  it  will  assoclatively  evoke  the  appropriate  response  In  the 
output.  Attention  is  still  required  to  transmit  the  vector  from  the  input  module  to  the 
output  module.  However,  attention  switching  of  the  comparison  vector  and  monitoring 
are  no  longer  necessary.  Processing  Is  parallel  In  memory  In  the  sense  that  the  input  will 
evoke  its  output.  Independent  of  the  number  of  Input/output  pairs  learned  by  the 
connection  matrix.7 

Phase  4  requires  a  small  amount  of  attention  to  transmit  the  Input  vectors  on  the 
Innerloop  to  the  output  modules.  If  there  are  multiple  Input  channels,  attention  would 
still  switch  between  the  Inputs.  For  example,  In  a  category  search  task  with  two  words 
being  presented,  i.e.,  display  size  two  as  In  Fisk  &  Schneider  (1983),  the  subject  attends 
to  the  first  word  transmitting  the  vector.  If  the  association  evokes  a  "yes*  response, 
then  the  response  is  made,  if  not,  the  second  word  is  attended  and  transmits  the  second 
vector.  If  the  vector  evokes  a  positive  response,  then  a  response  is  made;  If  not,  after 
attending  to  the  last  stimulus  input,  a  negative  response  Is  made.  Based  on  previous 
research  (Fisk  &  Schneider,  1983)  a  category  comparison  requires  about  200  ms  and  an 
associative  retrieval  probably  100  ms.8  Based  on  these  numbers  we  can  compare  the 
time  attention  Is  required  to  perform  a  two-word  to  two-category  comparison.  During 


7There  are  capacity  limits  to  associative  networks,  (see  Mumme  &  Schnelder.1087),  however.  Until 
there  ere  about  as  many  associations  as  half  the  number  of  connections  In  the  matrix,  associative  retrieval 
can  be  considered  Independent  for  uncorrelated  Input  vectors. 

Q 

The  latter  Is  more  difficult  to  estimate  since  It  Is  based  on  Intercept  effects.  Treat  this  number  as  a 
rough  estimate  for  purposes  of  Illustration.  We  speculate  the  minimal  transmission  time  Is  In  the  range  of 
30  to  100  msec  for  well-learned,  highly  discriminate  stimuli. 
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Phases  1-3,  four  comparisons  and  one  attended  response  are  made,  requiring  1.0  s  (4  x 
0.2  s  comparisons,  0.2  s  response  output).  During  Phase  4,  two  attention  switches  are 
made  In  0.2  seconds  (2  x  0.1s  visual)  and  one  assoclatively  evoked  response  transmission 
(0.1  s  motor).  When  the  subject  has  developed  a  reliable  level  of  proficiency  In  Phase  4, 
there  Is  a  70%  reduction  (1  s  vs.  .3  s)  in  the  amount  of  attention  and  an  88%  reduction 
(.8  s  vs.  .1  s)  in  transmission  time  on  the  lnnerloop.  At  this  stage  substantial  attention 
and  lnnerloop  transmission  time  are  available  to  perform  other  tasks. 

Phase  5  "  Automatic  processing  occurs  when  automatic  processing  substitutes  for 
attentlonal  processing.  During  Phases  1-4,  various  vectors  were  attended  In  order  to 
transmit  them  to  later  modules.  A  message  that  was  transmitted  prior  to  a  positive 
event,  eg.,  the  visual  input  'CAT*  associated  with  a  "yes*  response,  would  be 
associated  within  the  module  with  a  high  priority  tag.  In  contrast,  messages  that  are 
transmitted  without  follow-on  events,  e.g.,  the  word  *CAR*  which  Is  never  responded  to 
In  the  experiment,  would  have  a  low  priority  tag.  Automatic  processing  occurs  when  a 
message  associated  with  a  high  priority  event  Is  transmitted  in  the  absence  of  attentive 
Input.  This  occurs  when  the  local  circuit  of  the  priority  tag  Inhibits  the  attenuation 
units  transmitting  the  message  (see  Figure  i,  box  AP).  Automatic  processing  can  occur 
through  a  series  of  stages.  Each  stage  Involves  an  association  of  the  output  of  the 
previous  vector  which  evokes  a  new  vector  within  the  module.  Then,  that  vector  Is 
categorized  via  autoassoclatlve  interactions  and  evokes  the  priority  tag.  If  the  tag  Is 
high  enough,  the  priority  tag  causes  the  vector  to  be  transmitted  out  of  the  current 
module  to  the  next  module.  This  process  then  cascades  through  a  series  of  stages. 

Phase  5  processing  requires  no  attention,  however,  the  automatic  transmission  of 
messages  does  require  some  transmission  time  on  the  lnnerloop.  In  a  search  paradigm, 
priority  learning  eliminates  the  need  for  attentlonal  switching.  Assume  the  words 
"CAT*  and  "CAR*  are  presented,  with  "CAT*  being  a  previous  target  and  *CAR*  a 
previous  dlstractor.  The  *CAT*  message  will  evoke  a  high  priority  tag  and  be 
transmitted,  whereas  the  *CAR"  module  will  evoke  a  low  priority  tag  and  not  be 
transmitted.  Transmitting  the  *CAT*  vector  will  evoke  the  positive  response  in  the 
motor  module.  The  motor  response  will  evoke  a  high  priority  tag  and  transmit  Its 
message  and  cause  the  *yes*  response  to  be  made.  The  priority  tag-based  filtering  In 
Phase  4  eliminates  the  need  to  serially  transmit  visual  vectors  as  In  Phase  4. 

Table  1  contrasts  the  resource  demands  of  performing  the  search  task  as  a  function 
of  phase  of  processing. 
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Table  1  Resource  demands  as  a  fraction  of  the  phase  of  skill  acquisition 


Phase 

Attention 

Time 

Attention 

Switches 

Comparisons 

Innerloop 

Time 

l  Controlled  Processing  (CP) 

1.0  s 

5 

4 

0.8  s 

2  Context  maintained  CP 

1.0  s 

5 

4 

0.8  s 

3  Goal  state  maintained  CP 

1.0  s 

5 

4 

0.8  s 

4  Controlled  assist  AP 

0.3  s 

2 

0 

0.2  s 

5  Automatic  processing  (AP)  0.0  s  0  0 

Problems  of  multi-task  performance 

0.1  S 

In  many  applied  contexts. 

humans  must 

perform 

multiple  tasks 

concurrently 

Performance  typically  deteriorates  sharply  when  subjects  are  asked  to  perform 
concurrent  tasks.  The  connectlonlst/control  processing  architecture  provides  an 
Interpretation  of  the  difficulties  of  dual-task  performance  and  the  Improvements  that 
occur  with  practice.  Having  a  subject  perform  multiple  tasks  will  have  different 
consequences  depending  on  what  phase  of  practice  the  subject  has  reached  before  dual¬ 
task  processing  occurs. 

Having  a  subject  perform  multiple  tasks  during  Phase  l  will  show  severe  disruption 
In  performance.  For  example,  In  a  memory  scanning  experiment,  If  the  subject  briefly 
sees  a  memory  set  (too  briefly  to  allow  rehearsal)  and  then  performs  a  dlstractor  task, 
performance  Is  very  error  prone.  The  errors  result  because  the  buffer  codes  can  not  be 
maintained  and  the  comparison  time  requires  considerable  controlled  processing  and 
Innerloop  resources  (see  Table  l).  Performing  a  secondary  task  must  be  multiplexed 
with  the  attentional  processing  required  by  the  primary  task,  e.g.,  1.0  s  for  the  4- 
comparlson  category  search  task.  This  attentional  processing  Is  required  during  Phases 
1-3. 


Phase  2  processing  can  be  performed  after  an  Interruption  from  an  Irrelevant 
secondary  task.  In  a  search  task,  after  the  subject  rehearses  the  words  several  times  or 
performs  several  trials  with  the  same  memory  set,  the  memory  set  vectors  are  associated 
to  the  current  context.  If  an  Irrelevant  task  Is  performed  that  requires  attention,  the 
memory  set  vectors  will  decay.  By  re-evoklng  the  context,  the  memory  set  vectors  can 
be  reloaded  and  the  memory  comparison  process  restarted.  However,  If  the  subject 
performs  the  search  task  with  a  new  memory  set,  the  new  set  will  be  associated  to  the 
current  context  and  make  the  previous  memory  set  unavailable.  Therefore,  the  subject 
will  not  be  able  to  tlmeshare  a  process  that  requires  context  storage  of  information  In  the 
same  buffers. 

Phase  3  processing  can  be  performed  after  an  Interruption  from  a  relevant 
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secondary  task.  Once  the  coal  state  can  evoke  the  vectors  for  comparison,  the  context 
storage  Is  no  longer  necessary.  For  example,  If  the  subject  is  looking  for  words  from 
categories  A,  B,  or  C,  In  Phase  3,  presenting  the  category  labels  Is  sufficient  to  allow  the 
appropriate  vectors  to  be  evoked.  The  actual  comparison  still  requires  substantial 
attentlonal  processing,  e.g.,  1.0  s  for  the  category  search. 

Phase  4  processing  requires  little  attentlonal  processing.  It  can  be  timeshared  with 
other  tasks.  The  other  tasks  must  not  require  the  same  modules  as  the  Phase  4  task, 
and  provide  short  periods  of  time  when  attention  can  be  allocated  to  the  Phase  4  task. 
In  the  category  search  example,  the  Phase  4  process  required  .3  s  of  attentlonal 
processing  relative  to  l.o  s  for  Phases  1-3. 

Phase  5  processing  requires  only  a  short  period  of  innerloop  transmission  time  and 
the  modules  necessary  to  process  the  Input  and  output.  Phase  5  processing  can  occur 
concurrently  with  other  tasks.  The  other  tasks  must  not  completely  monopolize  the 
Innerloop  transmission  capacity.  In  the  category  search  example,  the  Phase  5  process 
requires  .1  seconds  of  Innerloop  transmission  time.  Assuming  one  set  of  stimuli  Is 
presented  at  a  rate  of  one  per  second,  the  category  task  would  require  the  visual  modules 
to  process  the  word,  .Is  of  Innerloop  transmission  time  to  transmit  the  visual  vector  to 
the  motor  module,  and  the  motor  modules  to  output  the  word.  A  secondary  task  could 
utilize  all  other  modules,  all  of  the  attentlonal  control  processing,  and  00%  (  9s  of  l  s)  of 
the  Innerloop  transmission  time.  In  this  way  two  tasks  utilizing  different  Input/output 
modules  with  modest  interloop  transmission,  e.g.,  reading  while  taking  dictation,  could 
be  processed  concurrently  at  single-task  performance  levels. 

Phase  5  processing  Is  still  limited.  If  two  modules  transmit  on  the  Innerloop  at  the 
same  time,  there  will  be  a  loss  of  Information.  In  the  model,  when  multiple  messages  are 
transmitted,  the  received  message  Is  the  addition  of  the  transmitted  messages.  For 
example.  In  a  task  In  which  the  subject  responds  to  a  word  with  a  button  press  and  to  a 
tone  with  a  spoken  word,  the  motor  module  might  receive  both  the  tone  and  the  visual 
features  of  a  word.  If  the  messages  are  of  equal  strength  and  evoke  Incompatible  outputs 
In  the  receiving  modules,  no  message  will  be  received  (see  below). 

Compensatory  activities  and  training  in  dual  task 

Given  the  above  changes  In  performance,  why  does  single-task  training  show  such 
limited  transfer  to  dual  task  performance?  A  simple  view  of  automatlclty  would  predict 
that  once  a  task  Is  automatic,  It  should  be  able  to  be  combined  with  other  tasks  without 
deficit.9  If  one  accepts  this  view,  one  should  train  all  the  single  tasks  and  spend 
relatively  little  time  in  dual-task  training.  However,  the  above  literature  review  shows 
this  prescription  to  be  wrong,  e.g.,  Schneider  &  Fisk  (1984)  found  nearly  novice-level 
dual-task  performance  after  8  hours  of  single-task  training. 


9 


A  view  sometimes  Inappropriately  ascribed  to  the  first  author. 
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This  section  focuses  on  what  happens  differently  In  dual-task  training  than  In 
single-task  training,  and  describes  how  the  system  compensates  In  the  dual  task  to 
facilitate  performance.  In  general,  performance  In  a  dual-task  situation  puts  a  premium 
on  the  use  of  attention  and  innerloop  transmissions  that  are  not  critical  In  the  single-task 
situation.  For  example.  In  single-task  category  search,  a  Phase  4  task  requires  .3  seconds 
of  attention  and  .2  seconds  of  innerloop  transmission  time.  Performance  Is  fast, 
accurate,  and  Involves  little  effort,  and  there  ts  little  need  to  change  strategy.  In 
contrast,  In  a  dual-task  experiment  even  the  .3  second  attention  load  may  be 
unacceptably  high. 

In  the  dual-task  situation  an  entirely  new  set  of  behaviors  must  be  executed  that 
are  not  required  In  the  single-task  situation.  To  Illustrate  this,  consider  a  category/tone 
dual  task.  The  category  task  requires  responding  to  visually-presented  animal  words  by 
pressing  a  button  with  the  right  hand.  The  tone  task  requires  saying  ‘target*  whenever 
a  high  tone  Is  presented.  Assume  the  category  and  tone  tasks  are  at  Phase  4  levels  of 
skill.  In  single-task  category  search,  the  subject  need  only  switch  attention  between  the 
Input  channels,  monitor  the  motor  channel,  and  release  the  response.  In  the  dual  task, 
the  subject  must  additionally  switch  between  the  visual  and  auditory  regions.  If  the 
regions  transmit  at  different  time  scales,  transmission  durations  must  be  changed.  The 
system  must  switch  what  Is  being  monitored,  l.e  ,  during  visual  transmission  the  motor 
system  Is  monitored  and  during  auditory  transmission  the  speech  module  Is  monitored. 
If  different  criteria  are  needed  for  received  messages,  e  g.,  the  motor  association  Is 
stronger  than  the  speech  association,  these  must  be  switched  between  tasks.  If  a  module 
Is  loaded  Inappropriately,  the  module  must  be  cleared  to  load  the  message  from  the 
appropriate  message,  e.g.,  in  the  category/tone  task,  If  the  speech  system  loads  a 
message  during  the  transmission  of  the  visual  word,  that  message  must  be  cleared  before 
the  message  associated  to  the  tone  can  be  loaded.  The  system  must  also  block  the 
Interference  effect  of  the  second  message  transmission,  e.g.,  the  motor  module  must  not 
be  cleared  by  the  transmission  of  the  tone  message,  the  motor  message  must  output  to 
other  levels  In  the  motor  region  while  the  tone  message  is  being  transmitted. 

The  types  of  compensatory  activities  that  are  effective  In  multi-task  situations 
depend  on  what  phase  of  skill  the  subject  Is  In  on  each  of  the  tasks.  If  two  tasks  are  In 
Phase  l,  dual-task  performance  Is  very  erroneous  except  for  very  simple  tasks.  Errors 
occur  because  the  subject  Is  unable  to  maintain  information  about  the  second  task  while 
performing  the  first  task.  If  the  subject  Is  In  Phase  2  or  3,  the  two  tasks  can  be  done  by 
performing  each  task  separately  with  little  overlap.  The  subject  can  use  context  and 
goal-state  Information  to  reactivate  the  vectors  of  the  second  task  after  performing  the 
first  task.  In  Phase  4  the  subject  must  switch  attentlonal  processing  between  the  tasks, 
but  the  Input  vectors  will  evoke  the  appropriate  outputs  without  having  to  reload  or 
compare  vectors.  In  Phase  5  the  subject  must  modify  the  Interloop  transmissions  to 
eliminate  Interference  among  messages. 

There  are  seven  behaviors  that  the  current  architecture  Illustrates  In  dual-task 
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situations  tb&t  either  do  not  occur  Id.  or  are  aot  as  critical  for  single-task  situations. 
These  all  Involve  decreasing  the  load  on  limited  atteDtional  resources  and  Innerloop 
transmissions. 

1.  Task  shedding,  delay,  and  pre*loadtng 

When  two  tasks  must  be  performed  concurrently,  the  easiest  compensatory 
activities  are  to  either:  l)  not  perform  one  of  the  tasks,  or  2)  delay  the  lower  priority 
task.  Not  performing  tasks  eliminates  the  cognitive  load  of  those  tasks.  Training  In  a 
dual-task  enables  the  learner  to  realize  which  tasks  can  be  deleted  and  how  much 
cognitive  capacity  is  made  available  to  other  competing  tasks.  Experience  In  performing 
multiple  tasks  enables  one  to  anticipate  and  monitor  the  consequences  of  delaying  or 
eliminating  a  task.  For  example,  under  conditions  of  high  workload,  experience  guides 
the  performer  in  knowing  when  to  delay  a  task  or  to  delete  It  altogether  in  order  to 
attend  to  a  higher  priority  task  component. 

To  manage  workload,  it  would  be  best  to  even  out  potential  workload  peaks,  either 
by  delaying  or  pre-executing  procedures.  Delay  Involves  buffering  the  Input  from  one 
task  while  performing  the  more  critical  task.  In  the  present  connectionist/control 
architecture,  every  module  is  a  buffer  that  can  maintain  Information  Tor  short  periods  of 
time  (see  Schneider  &  Detweller,  1987).  If  stimuli  from  two  tasks  are  presented 
simultaneously,  the  lower  priority  stimulus  can  be  buffered  while  the  higher  priority  task 
Is  completed,  then  the  lower  stimulus  can  be  processed.  If  the  two  tasks  require  the 
same  modules,  e  g.,  foveal  vision,  the  first  task  can  be  performed  first  and  then  the 
second  task. 

Pre-loading  involves  preprocessing  information  prior  to  the  onset  of  the  critical 
workload  segment.  If  vectors  can  be  preioaded  and  maintained  io  modules,  these 
activities  need  not  be  executed  while  the  stimuli  are  being  processed.  For  example,  the 
context  storage  system  can  activate  vectors  In  many  modules  simultaneously.  If  an 
operator  reviews  the  various  tasks  before  the  critical  task  segment  begins,  attending  to 
the  context  can  evoke  vectors  In  multiple  modules  simultaneously.10  Training  combat 
helicopter  pilots  illustrates  this  technique.  Pilots  are  encouraged  to  verbally  rehearse 
possible  actions  before  a  *pop  up*  maneuver  exposing  themselves  to  enemy  fire  -  a  high 
workload  situation.  In  the  present  architecture  this  rehearsal  associates  task-relevant 
vectors  to  the  current  context.  The  context  can  then  maintain  the  vectors  so  they  do 
not  decay  when  they  are  not  rehearsed.  Preloading  eliminates  the  need  to  recall  each 
procedure  once  the  task  has  begun. 


io 


Under  the  restriction  that  each  task  operates  In  separate  modules. 
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2.  Letting  go  of  unnecessary  high  workload  strategies 

Subjects'  strategies  In  single-task  conditions  may  entail  a  greater  workload  than 
necessary.  The  five  phases  of  skill  acquisition  described  above  represent  CAPi's  learning 
stages.  Certain  levels  of  practice  are  required  before  later  phases  become  reliable. 
However,  a  subject's  strategy  may  Influence  how  long  s/he  continues  a  high  workload 
strategy  after  the  lower  workload  strategy  Is  active.  In  our  letter  search  experiments  (see 
Schneider,  1685)  a  proportion  of  our  subjects  continued  to  exhibit  serial  search,  l.e., 
reaction  times  that  Increase  linearly  with  the  number  of  comparisons  long  after  the 
majority  of  our  subjects  exhibited  parallel  search,  l.e.,  non-linear  or  flat-slope  functions. 
When  these  subjects  were  pressured  to  respond  faster,  we  often  saw  a  dramatic  break  In 
the  slope  function.  This  break  suggests  they  shifted  from  a  Phase  3  to  a  Phase  4 
strategy. 

Training  under  high  workload,  e  g.,  In  multi-task  situations,  encourages  subjects  to 
adopt  low  workload  strategies.  In  a  single-task  situation,  a  high  workload  strategy,  e.g., 
using  Phase  3  when  Phase  4  or  5  are  reliable,  results  In  somewhat  slower  and  more 
effortful  processing  than  necessary.  A  high  workload  strategy  may  also  be  somewhat 
more  accurate  than  a  lower  workload  strategy.11  In  the  single-task  situation  subjects 
may  persist  in  using  a  high  workload  strategy.  In  the  multi-task  situation,  using  a  high 
workload  strategy  on  one  task  may  produce  very  poor  performance  on  another  task  - 
due  to  depriving  the  second  task  of  resources.  In  order  to  boost  total  performance  the 
learner  may  try  to  alternate  strategies  between  trials.  The  learner  may  realize  the  low 
workload  strategies  are  reliable,  e.g.,  learning  to  read  provides  an  Illustration  of  this  (see 
Laberge  &  Samuels.  1674).  Students  can  voice  each  word  as  they  read  It  and  become 
very  accurate  at  sight  reading.  However,  If  a  reader  uses  this  high  workload  strategy  to 
decode  the  words,  s  he  Is  slow  and  cannot  allocate  sufficient  resources  to  adequately 
comprehend  the  text.  In  a  speeded  reading  task  with  a  comprehension  test,  the  reader  Is 
more  ilkely  to  "let  go*  of  the  high  workload  word  decoding  strategy.  Once  the  learner 
begins  practicing  the  later  phases  of  the  skill,  these  phases  can  become  reliable  and 
enable  availability  of  lower  workload  strategies. 

For  the  CAPi  model,  experience  during  later  phases  of  skill  acquisition  results  In 
belter  transfer  to  high  workload  performance  than  during  earlier  phases.  For  example, 
practice  during  Phases  1-3  produced  associations  between  the  added  vectors,  l.e.,  those 
used  in  the  comparison  operation  and  responses.  In  the  category  search  task,  the  added 
semantic  vectors  of  the  visual  word  and  the  lexical  category  were  associated  to  the  motor 
response,  e.g.,  the  vector  sum  of  *CAT*  +  'ANIMAL*  are  associated  to  the  *yes* 
response.  During  Phase  4  performance,  the  word  vector  Is  transmitted  without  the 
category  vector,  e.g.,  the  vector  *CAT*  alone  without  being  summed  with  'ANIMAL*. 
Phase  S  performance  matches  Phase  4  performance  In  that  the  word  Is  transmitted  as  In 
Phase  4,  but  not  the  category,  as  In  Phase  3.  Associations  built  up  during  Phase  4  are  of 


11  With  modest  levels  of  practice,  e  g.,  less  than  100  trials  per  component.  Phase  i  Is  less  reliable  than 
Phase  3. 
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the  input  stimuli.  Transfer  from  Phase  3  to  4  will  be  depend  on  the  degree  of  overlap 
between  the  combined  and  Individual-vectors  (see  Schneider  &  Mumme,  1987).  Once 
Phase  4  Is  reliable,  practice  using  a  Phase  4  strategy  will  produce  better  transfer  to 
Phase  5  than  practice  using  a  Phase  3  strategy. 

3.  Utilising  noncompeting  resources 

During  multl-taslc  training,  the  subject  can  learn  to  allocate  task  components  to 
minimize  resource  competition.  Single-task  practice  facilitates  developing  a  strategy  that 
Is  fast  and  accurate.  Multi-task  training  facilitates  developing  the  optimal  combined 
strategies  for  performing  all  of  the  tasks.  For  example,  In  an  air  Intercept  control  task, 
one  can  determine  the  trajectories  of  two  aircraft  In  order  for  them  to  Intersect  using 
either  quantitative  or  spatial  procedures.  In  single-task  training  subjects  will  execute  the 
strategy  stressed  by  the  Instructor  and  that  produces  fast  and  accurate  responding. 
However.  If  the  learner  practices  the  Intercept  task  while  performing  a  concurrent  task 
requiring  spatial  operations,  e  g.,  navigation,  or  quantitative  operations,  e.g.,  calculating 
flying  time,  the  optimal  strategy  depends  on  the  context. 

Within  the  proposed  connectlonlst/control  architecture  multiple  resources  can  be 
Invoked  to  accomplish  tasks  (see  Schneider  &  Detweller,  1987).  Baddeley,  Grant,  Wight, 
and  Thomson  (1974)  provide  an  empirical  demonstration  of  utilizing  noncompeting 
resources  to  perform  concurrent  dual  tasks.  For  example,  digits  In  a  spatial  task  can  be 
stored  either  spatially,  e  g.,  as  a  visual  image  of  a  grid,  or  verbally,  e.g.,  as  the 
proposition  "5  left  of  8."  When  subjects  were  required  to  perform  a  concurrent  tracking 
task,  performance  was  better  when  they  used  a  verbal  code  to  store  the  digits  (Baddeley, 
et  al..  1974). 

Wlckens  (1984)  has  proposed  that  human  factors  designers  should  allocate  tasks  to 
modalities  to  minimize  resource  competition.  Early  statements  of  this  view,  e/\, 
Wlckens  (1980).  suggested  that  modalities  had  different  resource  pools,  and  hence 
putting  two  tasks  Into  different  modalities  would  be  advantageous.  Recent  reviews  have 
shown  that  dividing  tasks  between  modalities  sometimes  Improves  and  sometimes 
deteriorates  performance  (Wlckens,  Fracker,  &  Webb,  1988  for  this  conference).  In  the 
CAPl  architecture,  no  simple  prescription  of  dividing  tasks  among  modalities  Is 
applicable  over  a  wide  range  of  tasks  or  practice  levels. 

In  the  present  architecture,  placing  tasks  In  different  regions,  e.g.,  modalities,  Is 
sometimes  advantageous  and  sometimes  disadvantageous.  Placing  tasks  In  different 
regions  has  the  benefit  that  there  Is  no  competition  between  multiple  vectors  In  the  same 
modules,  e.g..  letters  can  be  stored  In  a  visual  module  and  tones  In  an  auditory  module 
with  little  interference.  However,  It  may  be  more  difficult  to  switch  attention  between 
modalities  than  within  a  modality.  Remember  that  Phase  1-3  level  skills  require  a  great 
deal  of  attention  switching,  whereas  Phase  4  requires  little,  and  Phase  5  none.  This 
suggests  that  dividing  tasks  between  modalities  may  produce  Inferior  performance  Tor 
modestly  practiced  tasks,  e.g.,  under  200  trials  per  stimulus  In  a  search  task,  while  at  the 
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same  time  producing  superior  performance  for  cons  Jtent,  well-practiced  taalcs  which  do 
not  require  attention  switching. 

4.  Time-multiplexing  skills 

Training  In  the  multi-task  situation  provides  experience  In  learning  to  time- 
multiplex  the  transmissions  on  tbe  innerloop  to  accomplish  the  combined  tasks.  If  two 
modules  transmit  messages  at  the  same  time,  the  potential  of  Intermessage  Interference 
exists.  For  example,  In  the  dual  category/tone  task.  If  both  the  visual  and  tone  vectors 
transmit  at  the  same  time,  the  receiving  modules  will  receive  the  summed  vector,  e.g., 
the  sum  of  the  semantic  vector  of  "CAT*  and  'HIGH  TONE*.  These  vectors  are  likely 
to  be  unrelated  and  tbe  interference  will  result  in  both  messages  being  blocked  (see 
Figure  4A).12  If  the  messages  are  multiplexed,  l.e.,  transmit  'CAT*  then  'HIGH 
TOiVE*  (see  Figure  4B),  both  messages  can  be  received  accurately  after  a  small  delay  for 
the  second  message. 


Insert  Figure  4  about  here 


Multi-task  training  provides  the  learner  opportunities  to  exercise  different 
multiplexing  schemes.  Given  that  two  tasks,  A  and  B,  must  be  accomplished,  there  are 
many  multiplexing  schemes.  Should  A  be  completed  before  B  Is  started?  Should  all  the 
Inputs  be  multiplexed  before  the  outputs  are  multiplexed?  Should  A  be  multiplexed  at  a 
higher  rate,  e  g.,  process  task  A  twice  as  often  as  task  B?  Senders  (1983)  and  Moray 
(1984.  1986)  have  shown  that  after  extended  training,  human  operators  learn  to  sample 
instrument  gages  at  the  optimal  rate,  based  on  the  relative  Information  rate  of  each 
channel.  The  allocation  of  Internal  control  processing  may  be  tuned  through  experience 
In  a  manner  comparable  to  the  way  the  operators  allocate  attention  between  gages. 

5.  Shortening  transmissions 

In  multi-task  situations  Innerloop  transmission  time  is  at  a  premium.  Learning  to 
transmit  messages  with  shorter  transmissions  enables  more  transmissions  per  unit  time 
and  hence  better  multi-task  performance.  In  a  single-task  condition  there  Is  no  benefit 
for  shorter  transmissions.  When  the  visual  module  transmits  to  tbe  motor  module,  a 
transmission  of  loo  or  500  ms  may  have  effectively  the  same  result,  When  the  motor 
module  receives  the  message  in  100  ms,  the  response  Is  begun.  If  the  visual  system 
transmits  for  an  additional  400  ms,  no  damage  Is  done.  Long  transmission  times  have 
the  potential  benefit  of  increasing  the  reliability  of  the  transmission  In  the  event  that 
some  messages  require  more  than  100  ms  to  complete.  Single-task  training  Is  likely  to 
lengthen  the  transmission  time  of  messages  transmitting  on  tbe  Innerloop. 


12 

The  degree  of  Inter-message  Interference  depends  on  the  relative  strength  of  the  vectors, 
orthogonality  of  the  vectors,  and  the  degree  to  which  the  receiving  module  has  learned  to  receive  each 
vector.  If  the  messages  are  of  equal  strength,  orthogonal,  and  equal  In  degree  of  learning,  neither  message 
will  be  received  reliably. 
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Multi-task  training  encourages  the  learner  to  shorten  transmissions  to  the  minimum 
length  sufficient  to  transmit  the  information  (see  Figure  4C).  This  Is  analogous  to  what 
Is  seen  In  air  traffic  control  tasks.  When  traffic  Is  light,  controllers'  radio 
communications  are  often  long,  social,  and  more  Is  communicated  than  Is  essential.  In 
contrast,  under  heavy  traffic  conditions,  communications  become  almost  telegraphic, 
relying  on  a  minimal  number  of  Jargon  phrases  to  convey  necessary  Information.  In  high 
workload  situations  short  communications  allow  more  tasks  to  communicate  critical 
Information  between  sender  and  receiver. 

in  the  simulation  model,  multi-task  training  enables  the  system  to  determine  the 
minimal  transmission  time  and  to  tune  the  receivers  to  reliably  detect  short  duration 
transmissions.  Under  the  pressure  of  high  workload,  the  operator  Is  likely  to  vary 
transmission  times  on  one  task  to  free  capacity  for  other  tasks.  The  algorithm  for 
varying  transmission  time  Is  simple  ~  If  the  transmission  was  successful,  reduce 
transmission  time  on  the  next  trial;  conversely,  If  transmission  was  unsuccessful.  Increase 
transmission  time.  The  net  result  Is  that  the  system  finds  the  minimum  transmission 
time  necessary  to  transmit  the  message  reliably.  A  secondary  benefit  of  practicing  with 
shorter  transmission  times  Is  that  short  transmissions  can  tune  the  receivers  to  categorize 
the  noisy  short  transmissions  Into  the  appropriate  message.  Reception  of  short,  noisy 
transmissions  becomes  more  reliable  If  the  receiving  module  Increases  the  degree  of 
feedback  and  the  amount  of  autoassociation  (see  Schneider  &  Mumme,  1087).  Practice 
under  high  workload  allows  the  system  to  Improve  performance  by  shortening 
transmission  times  and  strengthening  the  wlthln-module  associations  or  feedback.  In 
single-task  practice  these  changes  produce  negligible  performance  Improvement,  and  the 
system  may  asymptote  with  particularly  long  transmissions. 

6.  Converting  interference  from  concurrent  transmissions 

In  addition  to  avoiding  Interference  by  procedures  such  as  multiplexing, 
associations  can  be  modified  to  effectively  tune  out  specific  Interfering  messages.  Such  a 
tuning  effect  can  be  Illustrated  In  the  dual  category/tone  task  in  which  the  subject 
responds  to  animal  names  by  pushing  button  A  and  non-animal  names  by  pushing  button 
B.  The  tone  task  requires  the  subject  to  say  “target*  (response  C)  to  a  high  tone.  To 
Illustrate  Interference  effects,  assume  the  subject  also  learned  to  make  the  motor 
response  (push  button  D)  to  the  high  tone.  During  Initial  dual-task  training  the  conflict 
between  the  visually-evoked  response  (A  or  B)  and  the  tone-evoked  response  (D)  would 
produce  message  Interference  In  the  motor  module.  The  module  would  receive  the 
combined  message  (A+D)  which  Is  ambiguous.  This  Interference  necessitates 
multiplexing  the  messages  (see  point  4  above).  To  eliminate  Interference,  the  tone  Input 
would  have  to  either  elicit  no  response  In  the  motor  system,  or  elicit  a  response  that  does 
not  Interfere  with  the  A  or  B  responses. 

Multi-task  training  can  associate  an  Irrelevant  Input  to  the  relevant  outputs  such 
that  the  Irrelevant  Input  does  not  alter  the  outputs.  The  process  of  converting 
Interference  from  concurrent  transmissions  occurs  through  changes  In  the  association 
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matrices  between  modules.  The  process  Is  one  or  orthogonallzlng  the  vectors,  t.e.,  each 
connection  matrix  Is  altered  so  that  the  transmission  of  the  message  from  one  task  does 
not  bias  the  receiver  modules  of  the  other  task.  In  the  dual  category/tone  task, 
multiplexing  can  associate  the  Input  tone  with  the  motor  responses  for  the  visual  Input. 
On  a  dual-task  trial,  the  visual  stimulus  would  be  transmitted  and  evoke  the  appropriate 
motor  response,  e.g..  the  word  •CAT*  evoking  the  A  response.  Shortly  thereafter  the 
tone  would  be  transmitted  and  evoke  the  appropriate  response  In  the  speech  region,  e.g., 
if  high  tone  say  ’target”.  In  a  short  period,  the  motor  response  would  therefore  receive 
two  Input  messages  (word  and  tone)  and  make  one  response  (response  A).  Associative 
learning  would  cause  the  input  to  be  associated  to  the  output.  Thus  the  tone  association 
to  the  previous  motor  response  (response  D)  would  be  weakened,  and  the  association  to 
the  current  response  (response  A)  would  be  strengthened.  Similarly,  the  response 
between  the  word  and  the  speech  output  would  be  modified  by  the  word  determined  by 
the  tone.  On  other  trials,  the  tone  would  be  associated  to  the  motor  response  of  the 
responses  of  alternative  category  stimuli.  With  practice,  the  tone  would  no  longer  evoke 
Its  motor  response,  but  rather  evoke  a  response  made  up  of  the  average  of  the  two  motor 
responses  to  the  visual  Input.  After  dual-task  training,  the  motor  module  would  receive 
the  summed  code  of  A  for  an  animal  name  and  the  average  of  A+B  for  the  tone.  The 
net  input  (i.5A-i-  O.SB)  would  be  highly  correlated  to  the  A  response  and  would  generally 
be  categorized  as  the  A  response.  At  this  point  the  tone  stimulus  could  be  transmitted 
simultaneously  with  the  word  without  deficit,  thus  allowing  true  parallel  transmission  of 
both  stimuli  with  little  loss  (see  Figure  4D). 

Note  that  Interference  conversion  Is  specific  to  the  messages  trained,  and  all 
possible  combinations  of  the  messages  must  be  trained.  For  example,  to  learn  to 
associate  10  visual  stimuli  to  3  motor  responses  and  5  auditory  stimuli  to  2  speech 
responses  requires  learning  20  vlsual-to-speecb  patterns  (10  visual  stimuli  to  2  speech 
responses)  and  15  auditory- to-motor  responses  to  convert  the  Interference.  A  stimulus 
that  does  not  evoke  an  lnterferrlng  response,  e.g.,  the  word  ”blue”  normally  evokes  a 
key  press,  would  require  little  training  for  Interference  training.  However,  if  It  evokes  a 
strong  Interference,  e.g.,  the  visual  word  ”blue”  evoking  the  speech  output  of  the  word 
as  In  a  Stroop  (1935)  task,  extensive  training  will  be  required  to  convert  the  interference. 
If  new  stimuli  or  responses  are  Introduced,  these  will  also  have  to  be  trained. 
Interference  conversion  Is  specific  to  particular  messages  and  does  not  disconnect  regions. 
For  example,  practice  In  the  category/tone  task  may  orthogonallze  the  motor  output 
from  a  high  tone  message.  However,  such  practice  would  minimally  affect  the  motor 
association  of  dissimilar  auditory  Inputs,  e.g.,  respond  to  the  word  ’Jump”. 

The  specificity  of  Interference  effects  predicts  the  asymmetric  interference  effects 
often  observed  In  attention  and  multi-task  situations.  For  example,  In  Stroop  (1935) 
Interference  the  word  Interferes  with  naming  the  color  but  not  vlce-a-versa.  Since  one 
set  of  messages  Is  extensively  trained  relative  to  another,  e.g.,  naming  printed  words 
rather  than  naming  the  colors  of  printed  letters,  there  Is  an  asymmetric  interference 
effect.  Shaffer's  (1975)  report  on  dual-task  typing  also  Illustrates  this.  He  found 
concurrent  visual  typing  and  auditory  shadowing  were  done  with  little  cost,  whereas 
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concurrent  auditory  typing  and  visual  shadowing  produced  severe  Interference.  In  the 
present  model  one  would  expect  extensive  practice  at  copy  typing  with  some  concurrent 
comprehending  of  concurrent  messages  would  strengthen  visual  word-to-motor 
connections,  strengthen  audltory-tosemantlc  connections,  and  somewhat  weaken  the 
cross  connections.  These  changes  would  support  parallel  visual  typing  and  auditory 
transmission  while  resulting  in  very  poor  auditory  typing  and  visual  comprehension. 

7.  Chunking  transmissions 

If  the  system  can  transmit  chunks  or  compact  codes,  more  transmission  time  can  be 
available  for  other  tasks.  Consider  the  behaviors  of  a  copy  typist.  The  letter  pattern 
•THE*  might  be  transmitted  to  the  fingers  as  the  "TV  *HV  *EV  requiring  three 
transmissions  (see  Figure  5A).  In  contrast,  if  the  visual  system  can  transmit  a  combined 
code  of  •THE*,  only  a  single  transmission  Is  needed  (see  Figure  5C).  Transmitting 
syllable  chunks  would  reduce  lnnerloop  transmissions  by  about  66%.  With  a  chunk  size 
of  three,  three  tasks  can  be  accomplished  with  only  a  minor  delay  In  each  task. 

The  motor  system  could  then  decode  the  chunks  and  output  the  Individual 
messages.  Practice  In  the  single-task  condition  will  show  little  benefit  from  chunking 
unless  the  output  is  limited  by  the  transmission  time,  e.g.,  until  a  person  types  faster 
than  10  characters  per  second  there  Is  no  speed  advantage  in  developing  chunk  codes  for 
a  single  task.  In  contrast.  In  the  multi-task  situation  any  reduction  In  attention  or 
transmission  time  on  one  task  provides  more  attention  and  lnnerloop  transmission  time 
Tor  other  tasks. 

Developing  chunk  codes  can  occur  when  modules  transmit  In  parallel  from  input  to 
output.  In  the  simulation  model  (see  Schneider  &  Mumme,  1887),  the  system  modifies 
the  connections  between  the  input  and  output  modules  so  the  Input  comes  to  evoke  the 
output.  To  develop  proper  associations,  the  correct  output  must  be  In  the  output 
module  before  the  to-be-learned  Input  vector  Is  transmitted.  This  can  occur  If  all  the 
Input  modules  first  sequentially  transmit  each  message  Individually,  then  all  the  Input 
modules  transmit  simultaneously  before  the  output  Is  associated  to  the  input.  For 
example,  consider  transmitting  the  word  "THE*.  Initially,  if  all  three  messages  are 
transmitted  simultaneously  no  letter  Is  received,  due  to  inter-message  interference. 
However,  If  the  letters  are  transmitted  sequentially,  the  •TV  #H#  and  the  "E*  can  be 
evoked  In  the  output  region,  translated  to  the  appropriate  response,  and  buffered  (see 
Schneider  &  Detweller,  1987).  If  all  the  Input  modules  transmit  simultaneously,  followed 
by  the  transmission  of  the  output,  the  combined  Input  code  will  come  to  evoke  the 
combined  output  code. 


Insert  Figure  5  about  here 


A  chunk  can  be  transmitted  by  transmitting  all  the  elements  of  the  chunk  in 
parallel  (Figure  SB).  To  accomplish  parallel  transmission  of  the  chunk.  Interference 
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coDversion  must  occur  (see  above,  Figure  4 D)  such  that  the  message  transmitted  to  the 
first  module  does  not  Inappropriately  bias  the  second  module.  For  example,  in  the 
context  of  transmitting  “TH*  from  the  first  2  modules,  the  *E*  code  In  the  third  motor 
module  would  be  activated  strongly,  other  vowels  weakly,  and  Inhibit  most  consonants. 
Transmission  of  *TH*  would  not  Interfere  with  the  transmission  of  *E*  or  'A*  but 
would  interfere  with  the  transmission  of  *C*  in  the  third  position.  This  Is  the  same 
process  as  discussed  In  multi-task  training  In  which  the  visual  and  auditory  transmissions 
did  not  interfere.  In  chunking,  It  Is  Important  that  parallel  transmissions  from  modules 
In  the  same  region  (see  Figure  2B  level  3  visual  modules)  do  not  interfere  with 
concurrent  transmissions  from  the  same  region.  Developing  such  transmission  capacity 
requires  training  in  the  •multi-task*  situation  of  transmitting  the  codes  concurrently. 
Training  Is  specific  to  the  codes  practiced,  e.g.,  practice  at  sending  •THE*  will  result  in 
no  transfer  to  sending  'AND*. 

A  second  method  of  transmitting  a  chunk  Is  by  compacting  the  information  Into  a 
chunk  code  and  transmitting  the  chunk  code  from  a  single  module  (Figure  SC).  In  the 
current  model,  the  visual  region  can  transmit  either  letters  or  syllables  on  the  tnnerloop. 
In  acquiring  typing,  the  motor  system  would  first  learn  to  respond  to  Individual  letters, 
e.g.,  for  the  visual  transmission  of  a  *T*  move  the  left  Index  finger  to  the  •T,"  position 
and  press  the  key.  A  word  would  produce  a  series  of  letter  codes  In  the  second  level  of 
the  visual  region  and  a  series  of  movements  In  the  third  level  motor  region  (see  Figure 
2B).  With  training  the  last  level  of  the  visual  region  would  develop  a  combined  code  for 
the  visual  pattern  ■THE*  and  the  first  level  of  the  motor  region  would  develop  the 
combined  code  for  the  motor  task  of  outputting  'THE*  on  the  keyboard.13 

Developing  chunk  codes  has  an  important  advantage  over  parallel  transmission  In 
that  It  allows  for  greater  capacity  to  buffer  transmissions.  In  the  current  model, 
Schneider  &  Detweller  (1087)  assume  that  regions  can  maintain  vectors  In  only  a  small 
number  (three  to  four)  of  modules  for  a  given  level  of  processing.  A  typist  who  could 
visually  chunk  syllables  could  maintain  Information  for  12  letters  In  the  visual  system 
and  12  keys  In  the  motor  system.  A  typist  without  visual  chunking  could  only  maintain 
4.  By  chunking  the  encoding  and  decoding  one  can  operate  asynchronously.  The  eyes 
can  move  at  a  rate  determined  by  word  frequency,  and  the  fingers  by  the  typing 
configuration  of  the  keys  with  an  average  of  four  transmissions  per  every  12  characters. 
For  a  60  word  per  minute  typist  this  would  entail  ooly  2  transmissions  per  second. 
Delays  In  visual  transmission  would  have  no  impact  on  output  until  the  average  delay 
reduced  transmissions  below  2  per  second  or  long  delays  occurred  together,  such  that  all 
the  motor  buffers  would  be  output  before  the  next  visual  transmission  occurs.  In 
contrast,  transmitting  the  letters  In  parallel  (Figure  5B)  will  cause  delays  whenever  the 
encoding  time  for  any  4-letter  string  is  greater  than  the  typing  time  of  that  string  and 
will  require  holding  the  next  visual  input  until  the  current  motor  output  Is  completed. 


13 

This  could  be  accomplished  by  training  a  set  of  hidden  units  to  encode  all  the  features  of  the  previous 
layer  (see  Rumelhard,  Hinton.  &  Williams  1086). 
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This  is  because  in  parallel  transmission,  the  same  motor  modules  are  used  for  reception 
of  the  letter  and  output  of  the  letters.  Hence,  the  visual  system  cannot  transmit  the  next 
set  of  letters  until  the  first  set  of  letters  Is  completely  output.  If  chunk  transmissions 
occur,  the  motor  system  can  receive  the  second  chunk  while  the  first  chunk  is  being 
output  (see  Schneider  &  Detweller  1087). 

The  three  modes  of  outputting  chunks  shown  in  Figure  5  produce  dramatically 
different  output  rates  and  innerloop  loads.  Using  a  typing  example,  assuming  a  50  word 
per  minute  (wpm)  typist,  a  characters  per  word,  .2s  transmission  times,  1-5  letter  syllable 
chunks  (average  3  letters):  sequential  output  would  average  a  100%  innerloop  processing 
load  and  the  parallel  or  chunk  transmission  a  33%  load  (one  transmission  every  3 
letters).  If  one  assumed  that  the  encoding  and  decoding  occurred  at  one  character  per  .1 
seconds,  a  typist  would  have  an  average  typing  rate  of:  50  wpm  for  sequential 
transmission  (.2s  per  letter  transmission);  SO  wpm  for  parallel  transmission  (.2s  per 
transmission  and  3s  delay  for  the  output  3  characters  from  the  buffer):  and  100  wpm  for 
chunked  output  (  3s  to  concurrently  input,  transmit  and  output  a  3  letter  chunk). 

Research  Agenda 

The  present  theoretical  framework  provides  an  agenda  for  the  study  of  single-  to 
multi-task  transfer.  The  literature  review  at  the  beginning  or  this  article  Illustrates  that 
there  is  very  little  research  on  bow  single-task  training  transfers  to  multi-task  situations 
In  which  practice  levels  are  sufficient  to  develop  skilled  performance.  Learning  bow  to 
optimize  this  transfer  is  important  for  applied  questions  such  as  optimizing  simulator 
time  in  skill  acquisition  and  answering  basic  questions  such  as  bow  cognitive  processing 
changes  with  practice. 

Research  and  formal  modeling  of  single-  to  multi-task  transfer  is  still  in  its  infancy. 
To  understand  the  acquisition  of  high  performance  skills  will  require  collaboration  among 
the  applied  and  basic  research  communities.  Due  to  the  high  cost  of  training,  most  high- 
performance  skill  acquisition  occurs  In  applied  training  programs.  The  basic  research 
community  can  perform  laboratory  studies  tracking  the  phases  of  skill  development  and 
produce  quantitative  models  of  skill  acquisition.  The  following  research  issues  are  listed 
to  suggest  directions  for  future  research  that  seem  particularly  fruitful  within  a  research 
program  evaluating  skill  acquisition  within  the  proposed  connectlonlst/control 
architecture. 

Marginal  utility  of  single  to  multi-task  transfer 

The  present  model  predicts  there  Is  a  declining  benefit  of  single-task  training  for 
performance  in  multi-task  situations.  However,  there  is  a  benefit  for  single-task  training. 
In  order  for  associative  learning  and  priority  learning  to  occur,  the  learner  must  be  able 
to  maintain  the  necessary  vectors  in  memory  and  to  perform  the  task  accurately. 
Starting  a  learner  In  a  multi-task  situation  Is  likely  to  overload  processing  during  Phases 
1-3.  Hence  it  is  beneficial  to  Instruct  the  learner  on  each  component  task  individually. 
The  learner  should  then  perform  the  task  In  the  single-task  mode  until  performance  is 
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fast  and  accurate.  To  assure  that  at  least  a  goal-state-malntalned  skill  level  (Phase  3)  Is 
developed,  It  is  Important  that  subjects  be  able  to  perform  well,  even  when  required  to 
randomly  execute  the  behaviors.14  Failure  to  provide  this  single-task  training  can  hinder 
progress  because  the  component  associations  never  become  reliable. 

Single-task  training  has  a  reducing  marginal  utility.  After  a  certain  level  of  skill  Is 
reached,  continued  single-task  training  can  be  Inefficient  compared  to  multi-task 
training.  First,  multi-task  training  allows  the  subject  to  develop  the  seven  compensatory 
activities  described  above.  Including  task  sbeddtng/delay,  letting  go,  utilization  of  non¬ 
competing  resources,  time  multiplexing,  shortening  transmissions,  chunking,  and 
converting  Interference  from  concurrent  transmissions.  In  addition,  multi-task  training 
typically  provides  more  practice  on  both  tasks  for  the  same  total  practice  time,  allows 
components  to  become  integrated,  and  generally  Is  more  motivating  to  the  learner. 

Systematic  research  1$  needed  to  Identify  parameters  predicting  the  marginal  utility 
of  single-task  training  and  the  optimal  sequencing  of  component-  and  total-task  training 
to  maximize  training  effectiveness.  Task  complexity  variables  need  to  be  identified  to 
predict  how  many  trials  are  needed  to  transition  a  skill  through  various  phases.  More 
studies  comparing  one  part-task  training  scheme  to  one  whole-task  scheme  will  be  of 
little  benefit.  A  sufficient  set  of  such  studies  (see  above.  Stammers.  1982)  already  exists 
and  provides  an  ambiguous  picture.  The  CAPl  model  shows  that  training  can  be 
inefficient  If  either  too  little  or  too  much  part-task  training  Is  provided.  A  meta  analysis 
lumping  studies  with  varying  degrees  of  practice  together  Is  expected  to  show 
Inconclusive  results. 

To  develop  guidelines  to  Improve  training,  criterion  variables  must  be  Identified  to 
predict  when  to  shift  from  part  training  to  aggregate  training.  The  current  model 
suggests  one  should  move  from  part-  to  whole-task  training  once  the  Individual's 
component  skills  have  reached  at  least  the  controlled  assist  phase  (Phase  4)  of 
proficiency.  The  phase  of  skill  development  can  be  Independently  verified  using 
secondary  task  tests  to  see  how  well  the  skill  can  be  performed  under  workload  (see 
Schneider  &  Detweiler,  1986).  Studies  mapping  out  the  functional  relationships  between 
proportion  of  time  that  should  be  spent  In  part-  and  whole-task  training  for  a  variety  of 
tasks  and  training  times  would  be  particularly  useful. 

Examination  of  learning  multi-task  compensatory  activities 

The  present  architecture  suggests  seven  compensatory  activities  that  develop  In 
multi-task  situations  that  have  little  significance  in  single-task  situations.  For  the  most 
part,  these  compensatory  activities  have  not  been  extensively  studied.  For  example, 
what  is  the  functional  relationship  between  delaying  one  task  and  the  probability  of 
correctly  executing  the  task  without  restarting  the  task?  In  multiplexing,  bow  many 


14Phases  1-2  become  fast  and  accurate  as  long  as  the  same  task  Is  executed  repeatedly  after  Instruction 
on  the  component. 
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channels  can  the  central  control  structure  keep  track  of  before  some  requests  are  lost? 
How  sensitive  Is  transmission  reliability  to  the  duration  of  transmissions?  How  lone  does 
It  take  to  build  a  higher-level  chunk  code,  and  does  It  require  explicit  practice  aimed  at 
building  chunk  transmission  codes?  How  hard  is  It  to  switch  between  modules  within  a 
region  and  between  regions?  How  fast  and  how  fully  can  dual-task  training 
ortliogonallze  transmissions  to  reduce  intermessage  interference?  Can  individuals  sample 
Information  from  different  regions  at  different  rates?  Do  they  naturally  develop  different 
sampling  rates?  Do  individuals  discover,  on  their  own,  what  tasks  should  optimally  be 
shed  In  high  workload  situations?  Real  world  multi-task  performance  requires  the 
development  of  compensatory  activities.  Basic  research  and  training  guidelines  relating 
to  understanding  compensatory  activities  is  severely  needed. 

Role  of  part-task  trainers  for  multi-task  skills 

It  is  important  to  Identify  how  part-task  training  can  be  modified  to  Increase 
transfer  to  multi-task  performance.  In  many  complex  training  systems,  part-task 
training  can  be  much  cheaper  than  full-task  training.  For  example,  training  a  pilot  how 
to  operate  radio  gear  in  a  part-task  desk-top  computer  simulation  might  cost  only  one 
thousandth  as  much  as  training  the  skill  in  a  full*motton/v|sual  scene  flight  simulator. 
However,  if  the  single-task  training  does  not  transfer  to  the  multi-task  situation,  then  the 
part-task  training  is  of  little  benefit.  Since  most  of  the  compensatory  activities  for 
dealing  with  high  workload  are  not  present  in  single-task  situations,  training  a  single 
component  may  have  limited  utility. 

Training  single  ta«ks  under  high  workload  may  have  substantial  benefits  over 
single-task  training  for  transfer  to  multi-task  situations.  Or  the  seven  compensatory 
activities  described  above,  all  except  converting  Intermessage  Interference  develop  under 
most  high  workload  situations  and  do  not  require  the  exact  messages  to  be  sent  for 
compensatory  activities  to  develop.  For  example,  any  dual-task  situation  will  encourage 
the  learner  to  delay  tasks,  adopt  low-workload  strategies,  develop  time  multiplexing 
skills,  shorten  transmissions,  and  chunk  transmissions. 

Within  the  current  architecture,  training  under  high  workload  is  critical,  but 
training  in  the  full  task  may  not  be.  Let  us  Illustrate  this  by  assuming  the  goal  is  to 
teach  four  tasks  A.B.C.D.  If  the  tasks  are  trained  individually,  appropriate 
compensatory  skills  for  dealing  with  high  workload  will  not  develop.  Hence,  single-task 
training  should  be  limited  to  Insuring  that  the  Individual  tasks  be  fast  and  reliable. 
.Alter  the  Individual  tasks  are  trained,  dual  tasks  should  be  practiced  to  develop  the 
compensatory  activities,  e  g.,  train  A.B.C.D  then  AB.CD,  then  ABCD.  The  dual-task 
combinations  should  be  chosen  such  that  tasks  that  can  be  integrated,  or  need  to  convert 
the  Interference  of  other  tasks'  transmissions  (due  to  need  for  concurrent  transmission) 
are  practiced  together.  During  dual-task  practice  compensatory  activities  develop  that 
transfer  to  other  multi-task  situations.  For  example,  training  during  dual-task  AB  may 
develop  the  A  task  such  that  automatic  transmissions  occur  In  chunks  transmitted  In 
short  bursts  multiplexed  at  a  fixed  rate.  The  resulting  A  transmissions  produce  little 
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load  either  on  attentlonal  processing  or  on  innertoop  message  traffic.  Therefore,  the  A 
skill  developed  In  the  AB  training  condition  should  transfer  well  to  multitask  situations, 
e.g.,  AC,  AD,  ABC,  ABD,  and  ABCD. 

The  current  modeling  suggests  that  part-task  trainers  should  be  multi-task  trainers. 
We  speculate  that  most  of  the  training  time  required  to  develop  high  skill  levels  Involves 
practice  moving  the  skill  from  the  goal-state-maintained  phase  (Phase  3)  to  the 
automatic  processing  phase  (Phase  5).  To  accomplish  this,  secondary-task  loading  Is 
critical.  The  high  workload  can  be  produced  either  via  presenting  a  calibrated  workload 
task  or  by  concurrently  practicing  other  high  workload  tasks.  A  calibrated  workload 
task  might  be  a  varied  mapping  auci.ory  search  task  that  requires  considerable  attention 
and  lnnerloop  transmission  but  does  not  Improve  with  practice  (see  e.g.  Fisk,  Derrick  & 
Schneider,  1987).  Let  us  refer  to  this  as  task  X.  The  training  simulators  would  train  A, 
B.  C,  D.  AX.  BX,  CX,  DX.  then  ABCD.  Such  part-task  training  simulators  would  cost 
about  the  same  as  the  single-task  trainers  and  might  produce  substantially  more  transfer. 
Perhaps.  It  would  be  more  efficient  to  build  multi-task  trainers  so  the  learner  would  use 
the  practice  time  to  develop  skill  on  task-relevant  procedures.  The  training  simulators 
would  train  A.  B.  C,  D.  AB,  CD,  then  ABCD,  with  the  bulk  of  the  training  time  being  In 
the  AB  and  CD  dual-task  training.  It  is  likely  that  part-task  trainers  that  can  train 
multiple  tasks  win  be  far  less  expensive  to  produce  than  full-system  simulators.  Research 
Is  required  to  determine  the  effectiveness  of  such  trainers  and  to  develop  guidelines  for 
task  analysis  and  the  division  of  tasks  across  training  devices. 

Quantitative  modeling  of  skill  acquisition 

There  Is  a  critical  need  for  developing  and  testing  quantitative  models  of  skill 
acquisition  with  emphasis  on  multi-task  performance.  Most  previous  modeling  of  high 
workload  performance  has  been  at  too  coarse  a  level  of  analysis  to  have  had  a  strong 
Impact  on  the  training  process.  We  feel  that  general  resource  theories  (e.g.,  Kahneman, 
1973;  Wlckens,  1984;  Navon  &  Gopher,  1980)  have  neither  dealt  explicitly  with  practice, 
or  differentiated  the  resources  to  a  level  of  detail  to  suggest  guidelines  for  training.  An 
analogy  to  economics  Illustrates  our  concern.  Macro-economic  theory  at  the  level  of 
predicting  GNP  has  had  a  very  limited  success  In  predicting  economic  shifts  or  providing 
business  managers  data  to  make  production  decisions.  In  contrast,  linear  programming 
techniques  that  predict  production  costs  as  a  function  of  specific  resources,  e.g.,  the  cost 
of  Ice-cream  as  a  function  of  the  cost  of  sugar,  milk,  chocolate,  allow  managers  to 
evaluate  alternative  configurations  of  training  time  and  training  devices  to  build  better 
skills  for  a  fixed  total  cost  of  resources. 

Models  need  to  specify:  l)  What  types  and  quantities  of  resources  exist;  2)  How  the 
resources  are  utilized  to  accomplish  specific  tasks;  and  3)  How  resource  utilization 
changes  with  practice.  The  connectlonlst/control  architecture  Illustrates  the  beginning 
of  such  a  model.  The  resources  Involve  the  number  and  kinds  of  modules,  attention 
switching,  transmission  time  on  the  lnnerloop,  number  and  strength  of  connections,  etc. 
The  computer  simulation  can  perform  specific  tasks  such  as  visual  search  and  acquisition 
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or  simple  digital  troubleshooting  skill.  Resource  utilization  changes  dramatically  as  skill 
Is  acquired.  Initially  performance  (Phases  1-3)  Is  slow,  serial,  and  effortful,  e  g.,  very 
attention  demanding.  With  practice,  automattclty  develops  and  performance  becomes 
fast,  parallel,  and  requires  little  effort,  e.g.,  no  e’tentlon  and  little  innerloop  transmission 
time. 


Modeling  efforts  should  emphasize  cognitive  architectures  rather  than  single 
models.  A  cognitive  architecture  Identifies  a  space  of  models  rather  than  an  Individual 
model  (see  J,  R.  Anderson,  1983;  Laird,  Rosenbloom,  &  Newell.  1988).  The  present 
connectlonlst/control  architecture  defines  such  a  space  of  models.  Within  this 
architecture  there  may  be  many  possible  individual  models,  e.g.,  postulating  different 
connection  patterns  among  modules  on  the  Innerloop  provides  a  family  of  related  models. 

15 


Model  predictions  should  be  compared  with  human  data  to  tune  the  modeling 
effort.  The  model's  predictions  of  practice  data  should  be  compared  to  human  skill 
acquisition  data.  The  models  should  be  able  to  predict  the  entire  practice  function.  In 
any  modeling  erfort  of  this  type  there  are  many  parameters  and  possible  configurations 
within  the  architecture.  Empirical  data  are  needed  to  determine  the  appropriate 
constants  in  the  model.  It  Is  possible  that  physiological  data  may  provide  suggestions  as 
to  what  connective  patterns  to  explore  (see  e.g.,  Mishkin  &  Appenzeller,  1987).  The 
modeling  should  predict  practice  data  for  a  variety  of  tasks  and  training  procedures. 


16Schnelder  is  developing  a  software  library  to  run  simulations  within  connectlonlst/control  architecture 
on  EM  AT  or  PS2  computers.  The  programs  will  be  made  available  for  research  and  Instructional 
purposes  to  facilitate  exploration  of  the  modeling  space. 
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Summary 

Review  of  the  multi-task  training  and  the  part-whole  task  literature  shows  that 
performance  on  consistent  tasks  changes  dramatically  with  practice.  Single-task  training 
can  transfer  to  multi-task  performance;  however,  that  transfer  can  be  very  limited  and 
dual-task  training  produces  substantial  performance  improvement  even  after  extended 
single-task  training.  Existing  theoretical  frameworks  for  multi-task  performance 
generally  do  not  predict  the  observed  limited  single-  to  dual-task  transfer  effects. 

A  connectionlst/control  architecture  for  skill  acquisition  and  multi-task  training 
effects  provides  an  interpretation  of  the  limited  nature  of  single  to  dual-task  transfer. 
The  model  details  interactions  at  the  microlevel  (neural  like  interactions),  macrolevel 
(module  interactions),  and  the  system  level  (regions  of  processing  and  control  structures). 
The  model  identifies  five  phases  of  skill  acquisition  Including:  l)  controlled  comparison; 

2)  context-maintained  controlled  comparison;  3)  goal-state  maintained  controlled 
comparison:  4)  controlled-assist  of  automatic  processing;  and  5)  automatic  processing. 

As  a  skill  progresses  through  the  five  phases,  there  Is  a  qualitative  change  In 
processing  that  enables  performance  of  multiple  tasks.  During  the  controlled-comparlson 
phases  (Phases  1-3)  performance  Is  slow,  serial,  and  very  effortful.  During  Phase  l.  dual¬ 
task  time  sharing  Is  very  error  prone.  By  Phase  3,  dual  tasking  Is  possible  as  long  as  the 
tasks  are  accomplished  sequentially,  e.g..  performing  task  A,  then  task  B,  without 
overlapping  processes.  A  qualitative  shift  in  processing  occurs  by  Phase  4,  when 
associative  retrieval  of  the  response  substitutes  for  the  controlled  comparison  during 
Phases  1-3.  Performance  Is  fast,  parallel,  l.e.,  retrieval  Is  fairly  Independent  of  the 
number  of  comparisons,  requires  little  effort,  and  utilizes  little  attentlonal  or  innerloop 
processing.  Phase  4  skills  can  be  processed  concurrently  with  other  tasks  as  long  as  there 
Is  sufficient  controlled  processing  capacity  to  briefly  assist  the  transmission  of  messages 
and  to  transmit  messages  on  the  Innerloop.  A  Phase  5  skill  can  be  performed  reliably 
without  the  aid  or  controlled  processing  and  requires  only  small  periods  of  Innerloop  time 
for  transmitting  messages  from  Input  to  output  regions.  Phase  5  skills  (automatic 
processing)  can  be  performed  concurrently  as  long  as  the  two  tasks  do  not  require  the 
same  modules  and  can  time  share  innerloop  transmission  time  or  transmit  non-Interferlng 
messages. 

Seven  compensatory  activities  occur  In  the  model  during  multi-task  training  that 
either  do  not  appear  In  single-task  training  or  are  not  as  critical  In  single-task  situations: 
l)  task  shedding,  delay  and  buffer  pre-loading;  2)  letting  go  of  high  workload  strategies; 

3)  utilizing  noncompeting  resources;  4)  time  multiplexing;  5)  shortening  transmissions;  6) 
converting  interference  from  concurrent  transmissions;  and  7)  chunking  transmissions. 
The  development  of  these  compensatory  activities  provides  an  Interpretation  of  the  large 
practice  effects  served  In  dual-task  situations  even  after  extensive  single-task  training. 

The  connectionlst/control  architecture  model  of  skill  acquisition  and  multi-task 
performance  suggests  an  extensive  research  agenda  of  basic  and  applied  Issues  relating  to 


Dual-task  practice 


28  September  1087 


35 


skill  acquisition  for  bleb  workload  tasks.  It  emphasizes  that  models  or  dual-task 
performance  must  deal  with  issues  of  practice.  Researchers  should  not  ask  simply 
whether  single-task  training  transfers  to  dual-task  performance  or  whether  part-  versus 
whole-task  training  Is  better.  Rather,  research  should  map  out  quantifiable  performance 
variables  assessing  the  marginal  utility  of  practice  and  predicting  the  optimal  points  to 
shift  from  single-  to  multi-task  performance.  The  learning  and  capacity  of  compensatory 
activities  that  develop  during  dual-task  training  must  also  be  Investigated. 
Understanding  these  Issues  might  greatly  facilitate  development  and  use  of  part-task 
trainers  for  developing  high  performance  multi-task  skills. 

The  present  connectlonlst/control  architecture  is  Just  beginning  to  deal  with  the 
complexity  of  practice  effects  during  skill  acquisition.  We  need  to  enrich  the  set  of 
quantitative  models  we  have  available  to  understand  and  predict  skill  acquisition. 
Human  performance  changes  dramatically  with  practice,  thus  making  variables  critical 
Initially  become  less  Important  or  Irrelevant  after  practice.  Any  model  of  high  workload 
performance  that  does  not  deal  with  practice  effects  Is  at  best  an  approximation  to  the 
system,  that  through  practice,  produces  a  variety  of  knowledge  structures  and 
compensatory  activities  to  perform  multi-task  skills.  Allowing  basic  and  applied 
researchers  to  conceptualize  and  predict  the  effects  of  practice  will  advance  the 
understanding  of  skill  and  the  training  of  high  performance  skills. 
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Figure  Captions 

Figure  l.  Microlevel  structure  of  the  CAPl  simulation.  Processing  Is  assumed  to 
occur  In  networks  of  neural-llke  units.  Units  are  organized  into  modules  (the  box  labeled 
M3  outlines  the  third  module)  that  process  a  particular  class  of  Inputs.  Information 
between  modules  Is  transferred  as  a  message  vector  (MV)  on  fibers  connecting  the  output 
of  one  module  to  the  Input  of  the  next.  In  the  diagram  Information  flows  from  left  to 
right  (e  g.,  the  top  left  MV  might  encode  visual  features,  the  two  left  modules  letters, 
and  the  right  module  words).  Each  module  contains  a  vector  of  output  units.  The 
output  units  receive  input  from  other  modules  and  connect  autoassoclatlvely  to 
themselves.  The  recurrent  connections  from  the  bottom  of  each  output  unit  going  up 
and  connecting  to  the  other  output  units  In  the  same  module  represent  the 
autoassoclatlve  connection.  Each  of  the  crossing  points  above  the  output  units  (to 
message  vector  or  autoassoclatlve  fibers)  represents  an  associative  connection  that  can 
change  the  strength  of  connection  with  learning.  In  the  rest  of  the  diagram  the  reverse 
arrow-type  connections  represent  excitatory  Influences  and  the  flat  connections  represent 
Inhibitory  influences.  A  module's  output  Is  controlled  by  an  attenuation  unit  within  the 
module.  The  attenuation  unit  regulates  Information  flow  from  the  module.  Each 
module's  activity  is  regulated  by  a  control  structure  (the  box  labeled  C3  represents  the 
control  structure  for  the  third  module).  Each  module  reports  Its  activity  to  the  lower- 
level  control  structure  via  activity  report  and  priority  report  units.  The  lower  units 
(labeled  1.  2.  3)  Illustrate  a  potential  control  circuit.  Cell  1  receives  the  activity  reports 
from  the  module  and  inhibits  the  activity  of  neighboring  modules.  Cell  2  Inhibits  Cell  3. 
reducing  the  attenuation  activation,  thus  reducing  Inhibition  or  the  output  units,  and 
enabling  a  message  vector  to  transmit.  Cell  2  Is  assumed  to  habituate  resulting  In  a 
burst  of  output  and  sequential  switching  or  attention.  The  AP  box  In  M3  Illustrates  the 
local  circuit  for  automatic  processing.  For  an  automatic  process,  the  priority  report  unit 
Inhibits  the  attenuation  unit  and  causes  the  vector  to  transmit  from  the  module. 

Figure  2.  System-level  description  of  the  model.  The  top  portion  of  the  figure 
(2A)  shows  the  message  vector  connections  between  regions.  The  bottom  portion  (2B) 
shows  the  macrolevel  view  of  some  of  the  regions.  The  squares  and  rectangles  In  Figure 
2B  represent  the  modules  and  control  structures,  e.g.  Figure  l  module  M3,  control  C3. 
This  Is  a  top-down  view  of  the  regions  of  processing  within  the  system.  Each  region 
represents  a  series  of  processing  levels.  The  first  or  last  level  of  a  region  (last  level  for 
Input  regions  and  first  for  output  regions)  Is  assumed  to  Input  to  the  Innerloop  of 
connections  between  regions.  The  modules  on  the  Innerloop  have  separate  message 
vectors  to  each  of  the  other  modules  they  connect  to.  Ail  the  lines  In  Figure  2A 
represent  message  vectors  (as  MV  In  Figure  1).  Each  module  sends  a  message  vector  to 
all  the  other  modules  on  the  Innerloop.  The  output  for  the  visual  module  Is  highlighted 
to  Illustrate  this  connection  pattern.  This  figure  represents  a  simple  view  of  one  of  many 
possible  connection  patterns  for  regions  on  the  Innerloop.  Figure  2B  Illustrates  the 
processing  of  a  visual  word  to  produce  a  button  press.  The  first  level  of  the  visual  region 
processes  letters,  the  second  characters,  and  the  third  words.  The  message  Is  then 
transmitted  to  all  modules  on  the  Innerloop.  The  dotted  set  of  modules  represent  all  the 
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other  regions  on  the  lnnerloop  receiving  the  visual  message.  The  motor  regions  illustrate 
the  motor  output  of  the  system.  The  first  level  of  the  motor  system  stores  motor  tasks, 
e  g.  Kl  represents  the  code  for  pressing  the  first  key.  The  second  motor  level  codes  the 
sequences  required  to  execute  the  motor  task,  e.g  L-llft  finger,  M-move  to  position  of  key, 
and  P-press  key.  The  third  motor  level  Illustrates  the  components  of  the  lift  sequence, 
e.g.  A-accelerate  upward.  W-wait  some  time,  and  ^decelerate.  The  darker  horizontal 
lines  represent  message  vectors,  the  thin  lines  the  control  signals.  Each  module  (square) 
sends  an  ACTIVITY  report  to  Its  controller  (rectangle  below  it)  and  receives  a 
FEEDBACK  and  TRANSMIT  control  signal  (the  three  lines  from  the  squares  to  the 
rectangles).  Control  modules  exchange  LOAD  and  NEXT  signals  (diagonal  lines  between 
rectangles)  to  control  sequential  processing  between  modules.  For  modules  on  the 
lnnerloop,  the  Central  Control  exchanges  control  signals  for  ACTIVITY,  RESET, 
TRANSMIT.  NEXT  and,  LOAD  to  modulate  lnnerloop  message  traffic  (see  Schneider  & 
Detweller,  1987). 

Figure  3.  Timing  diagrams  of  the  system  as  a  function  of  phase  of  skill  acquisition 
in  a  category  search  task.  The  bottom  line  provides  the  time  scale  In  seconds.  Phases 
1-3  Involve  multiple  transmissions  and  monitoring  of  messages.  The  elevated  bars  in  line 
a  show  the  transmission  of  the  vectors  from  visual  module  l  (Vi)  or  2  (V2),  e.g  the 
transmission  of  “CAT*  as  In  Figure  2B.  The  two  visual  modules  would  alternate 
between  transmitting  two  words,  e.g.  "TOP*  from  Vi  and  'CAT*  from  V2.  Line  b 
shows  the  transmission  of  the  lexical  vectors  for  the  words  stored  in  the  lexical  buffers, 
e  g.  'FRUIT*  and  'ANIMAL'  of  Ll,  L2.  Line  c  shows  the  power  or  activity  report  of 
the  cells  in  the  semantic  module.  The  activity  increases  during  transmission  due  to 
summation  and  feedback  effects  within  the  module.  The  first  wave  Is  for  the  sum  of  the 
Vi  and  Ll  vectors,  e.g.  the  sum  of  the  semantic  codes  for  'TOP'  +  'FRUIT*.  If  the 
activity  Is  below  the  match  threshold,  the  next  pair  is  transmitted,  e.g.  'CAT'  + 
•FRUIT'.  When  the  visual  vectors  have  ail  been  sent,  attention  Is  switched  to  the  next 
lexical  module,  e.g.,  'ANIMAL',  the  visual  module  pointer  Is  reset  and  the  Vi  +  L2 
comparison  occurs.  This  continues  until  either  there  Is  a  match  or  the  last  comparison  Is 
complete.  The  last  wave  In  line  c  illustrates  a  match,  'CAT'  +  'ANIMAL',  l.e..  a  high 
activity  report.  After  a  match,  the  'YES'  response  Is  transmitted  from  the  motor 
module  (line  d)  and  initiates  a  motor  response.  Phnae  4  occurs  after  associations  are 
built  up  between  the  visual  Input  and  motor  response,  such  that  the  target  'CAT' 
assoclatlvely  evokes  the  'YES'  response  and  bypasses  the  semantic  match  process  found 
In  Phases  1-3.  Two  transmissions  are  still  needed  (line  e)  from  the  visual  to  the  motor 
region  to  prevent  the  Vi  and  V2  messages  from  interfering.  Line  f  shows  the  received 
motor  activity  from  the  visual  transmissions.  Line  £  shows  the  transmission  of  the  motor 
response  after  It  was  received.  The  vlsual-to-motor  associations  were  built  up  during 
Phases  1-3.  Phase  5,  automatic  processing,  occurs  after  priority  learning  occurs.  The 
vectors  with  a  high  priority  tag  are  transmitted  automatically  (dashed  pulse  line  h) 
without  the  need  for  attention.  This  transmission  evokes  the  response  (line  [)  that 
produces  an  automatic  response  transmission  (line  h). 

Figure  4.  Compensatory  activities  developed  during  dual-task  training.  The 
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rectangular  pulses  Indicate  transmitted  vectors,  the  waves  the  received  Information. 
Initially  transmitting  the  visual  (line  l)  and  auditory  (line  2)  vectors  In  parallel  results  in 
little  Information  reception  In  the  motor  (line  3)  or  speech  modules  (line  4).  Time 
multiplexing  the  signals  (lines  5  and  6)  results  In  greater  reception  with  some  delay  of  the 
second  message  (lines  7  and  8).  Dual-task  training  can  shorten  transmissions  (lines  9-12) 
and  convert  Interference  (lines  13-16)  (see  text  for  details). 

Figure  V  Chunking  transmissions  for  the  output  of  the  letters  "TV  •HV  *E*  as  In 
a  typing  task.  Initially  the  letters  are  transmitted  sequentially  from  three  visual  modules 
to  three  motor  modules  (lines  1-6).  After  Interference  conversion  occurs  (see  text),  all 
three  messages  can  be  transmitted  as  a  set  (lines  7-12).  If  the  visual  system  develops  a 
chunk  code  for  •THE*,  It  can  be  transmitted  from  a  single  visual  module  (line  13)  and 
evoke  a  motor  code  for  all  three  outputs  (Une  14).  Lines  15-22  show  transmissions  within 
the  motor  region  that  do  not  occur  on  the  lnnerloop  (see  Figure  2B  for  levels  In  the 
motor  region).  After  the  message  Is  received,  the  level  l  motor  module  transmits  a 
vector  (line  15)  to  the  level  2  modules  and  evokes  the  "TV  •HV  and  *E“  code  In  three 
modules  (lines  16-18),  decoding  the  •THE*  chunk  at  motor  level  1.  The  second  level 
sequentially  transmits  the  •TV  •HV  'E*  (lines  19-21)  and  evokes  the  letters  In  the  level 
3  module  (line  22).  The  level  three  module  then  decodes  the  movements  of  each  letter 
Just  as  the  level  2  module  decoded  the  letters  for  each  word.  When  the  level  3  module 
returns  a  NEXT  signal  (see  Figure  2B  and  text),  the  next  transmission  outputs  from  the 
level  2  modules. 
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